This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 

As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

A01H 5/00, C07H 21/04, C12N 5/14, 
15/29, 15/52, 15/82 



Al 



(11) International Publication Number: WO 00/67558 

(43) International Publication Date: 16 November 2000 (16.1 1.00) 



(21) International Application Number: PCT/US00/ 12450 

(22) International Filing Date: 5 May 2000 (05.05.00) 



(30) Priority Data: 
60/132,919 



6 May 1999 (06.05.99) 



US 



(71)(72) Applicant and Inventor: TTMKO, Michael [US/US]; 1610 
Old Ballard Road, Charlottesville, VA 22901 (US). 

(74) Agent: HANSEN, Christine, M.; Connolly Bove Lodge & Hutz 
LLP, 1210 Market Street, P.O. Box 2207, Wilmington, DE 
19899 (US). 



(81) Designated States: AE t AG, AL, AM, AT, AU, AZ, BA, BB, 
BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, 
DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, 
IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, 
LV, MA, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, 
RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, 
UA, UG, US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH, 
GM, KE, LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE. SN, TD, TG). 



Published . . . . . 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: REGULATION OF GENE EXPRESSION IN TOBACCO FOR MANIPULATION OF PLANT GROWTH AND SECONDARY 
METABOLISM 



(57) Abstract 

This invention relates to enzymes involved in alkaloid, and specifically nicotine, formation in tobacco plants. The invention is based, 
at least in part, on the nucleotide sequences encoding four variants of putrescine N-methyltransferase (PMT1, PMT2, PMT3, and PMT4), 
two variants of arginine decarboxylase (ADCl and ADC2), ornithine decarboxylase (ODC), S-adenosylmethionine synthetase (SAMS), a 
fragment of NADH dehydrogenase, and a fragment of phosphoribosylanthranilate isomerase. The invention also relates to proteins expressed 
by these nucleotides, promoter regions of these nucleotides, use of these promoter regions to culture transgenic plant cells and to produce 
transgenic plants, sense and antisense nucleotides complementary to all or portions of these nucleotide sequences, use of sense and antisense 
nucleotides to regulate gene expression, and assays using proteins involved in alkaloid formation in tobacco plants. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT , 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


Hti 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


a 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


, NE 


Niger 


VN 


Viet Nam 


CC 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 00/67558 



1 



PGT/US00/12450 



REGULATION OF GENE EXPRESSION IN TOBACCO FOR MANIPULATION OF 
PLANT GROWTH AND SECONDARY METABOLISM 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is a continuation-in-part of US Patent Application Ser. No. 60/ 132,919, filed 
May 6, 1999, now abandoned, which is hereby incorporated by reference in its entirety herein. 

FIELD OF THE INVENTION 
This invention relates to enzymes involved in alkaloid, and specifically nicotine, formation in 
tobacco plants. The invention is based, at least in part, on the nucleotide sequences encoding four 
variants of putrescine N-methyltransferase (PMT1, PMT2, PMT3, and PMT4), two variants of 
arginine decarboxylase (ADC 1 and ADC2), ornithine decarboxylase (ODC), S-adenosylmethionine 
synthetase (SAMS), a fragment of NADH dehydrogenase, and a fragment of 
phosphoribosylanthranilate isomerase. The invention also relates to proteins expressed by these 
nucleotides, promoter regions of these nucleotides, use of these promoter regions to culture 
transgenic plant cells and to produce transgenic plants, sense and antisense nucleotides 
complementary to all or portions of these nucleotide sequences, use of sense and antisense 
nucleotides to regulate gene expression, and assays using proteins involved in alkaloid formation in 
tobacco plants. 

BACKGROUND OF THE INVENTION 

I. Alkaloid Formation 

Alkaloids are one of the most diverse groups of secondary compounds found in plants and 
they are the product of a complex biosynthesis pathway (Hashimoto and Yamada, 1994; Chou and 
Kutchan, 1998; Waterman, 1998). Why plants accumulate these compounds and in so many 
different forms is not known. Moreover, for many alkaloids, the exact site of synthesis and the 
factors that control their intercellular distribution and accumulation remain to be determined 
(Hashimoto and Yamada, 1994; Kutchan, 1995; Chou and Kutchan, 1998). 

Nicotine is the most abundant alkaloid present in cultivated tobacco. Nicotine is formed 
primarily in the roots of the tobacco plant and subsequently is transported to the leaves, where it is 
stored (Tso, Physiology and Biochemistry of Tobacco Plants, pp. 233-34, Dowden, Hutchinson & 
Ross, Stroudsburg, Pa. (1972)). 

The synthesis and accumulation of nicotine and other tobacco alkaloids are known to be 
controlled by various developmental, environmental, and chemical cues. Changes in phytohormone 
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(e.g., auxin, cytokinin) levels and/or ratios as a consequence of developmental age (Hashimoto and 
Yamada, 1994; Kutchan, 1995) or by direct manipulation of plant cell culture conditions have been 
shown to affect the synthesis and accumulation of nicotine and various tobacco alkaloids (Hashimoto 
and Yamada, 1994; Hibi et aL, 1994; Eilbert, 1998). Various abiotic factors (wounding, drought 
5 stress, pH imbalance, etc.) [Hashimoto and Yamada, 1994; Kutchan, 1998; Waterman, 1998) 1, 2, 4], 
as well as biotic factors, such as herbivory, insect feeding, and attack by various microbial and fungal 
pathogens, are known elicit increased production of nicotine and other alkaloids in the leaves of wild 
and cultivated tobacco species (Baldwin, 1989; Saito and Murakoishi, 1998; Baldwin and Prestin, 
1999). In addition, the commercial practice of topping (i.e., removal of flowering head and young 

10 leaves at the upper portions of the plant), results in increases in nicotine and the amount and 

complexity total alkaloids present in the leaves of Nicotiana tabacum (Hashimoto and Yamada, 
1994; Hibi et ai, 1994). The factors controlling the topping-induced increase in alkaloid 
biosynthesis are not known, but likely involve a complex physiological response in the plant as a 
result of altered phytohormones and wound induced signaling (Akehurst, 1981; Hibi et ah, 1994; 

15 Kutchan, 1998). In this regard, considerable evidence now exists indicating that a jasmohic acid 
(J A)- mediated signal transduction pathway may play a role in regulation of gene expression 
contributing to this increase in alkaloid biosynthesis (Baldwin et ah, 1994, 1996,1997; Ohnmeiss et 
aU 1997; Imanishi et a/., 1998a, 19986). 

The nicotine molecule is comprised of two heterocyclic rings, a pyridine moiety and a 

20 pyrrolidine moiety, each of which is derived from a separate biochemical pathway. The pyridine 
moiety of nicotine is derived from nicotinic acid. The pyrrolidine moiety of nicotine is provided 
through a pathway leading from putrescine to N-methylputrescine and then to N-methylpyrrohne. 
(Goodwin and Mercer, Introduction to Plant Biochemistry, pp. 488-91, Pergamon Press, New York, 
(1983)). 

25 Putrescine is formed in plants by one of two pathways (Chattopadhyay and Ghosh, 1998). It 

can be synthesized directly from ornithine, in a reaction catalyzed by the enzyme ornithine 
decarboxylase (ODC, EC 4.1.1.17), or formed indirectly from arginine in a reaction sequence 
initiated by arginine decarboxylase (ADC, EC 4.1.1. 19). Putrescine formed by the ADC and/or ODC 
pathway serves as precursor in the synthesis of the higher polyamines, spermine and spermidine, 

30 catalyzed by the enzymes spermine synthase and spermidine synthase, respectively, or it is converted 
to N-methylputrescine by the action of putrescine N-methyltransferase (PMT), the first committed 
step in nicotine biosynthesis (Hashimoto and Yamada, 1994; Kutchan, 1995; Chattopadhyay and 
Ghosh, 1998). N-methyl putrescine is oxidized by a diamine oxidase and cyclized to form the 1- 
methyl-A'-pyrrolium cation, which is condensed with nicotinic acid or its derivative to form nicotine 



WO 00/67558 



PCI7US00/12450 



3 

(Hashimoto and Yamada, 1994). 

Putrescene is a precursor for N-methylputrescine, which then forms N-methylpyrroline. 
Conversion of putrescine to N-methylputrescine is catalyzed by the enzyme putrescine 
N-methyltransferase ("PMT"), with S-adenosylmethionine serving as the methyl group donor. PMT 
appears to be the rate-limiting enzyme in the pathway supplying N-methylpyrroline for nicotine 
synthesis in tobacco (Feth et al., "Regulation in Tobacco Callus of Enzyme Activities of the Nicotine 
Pathway", Planta, 168, pp. 402-07 (1986); Wagner et al., "The Regulation of Enzyme Activities of 
the Nicotine Pathway in Tobacco", Physiol. Plant., 68, pp. 667-72 (1986)). 

n. TRANSGENIC PLANTS 

The methods of nicotine formation in tobacco and the genes involved have been studied both 
to better understand differential gene expression during tobacco growth and development, and also to 
discover tools useful for creating transgenic plants. For example, the regulatory sequences that 
modify protein expression in tobacco may be useful in creating transgenic tobacco or other 
transgenic plants. . 

It has already been demonstrated that tissues of many plant species may be transformed by 
exogenous, typically chimeric, genes which are effective to stably transform cells of the tissues. For 
several species, tissues transformed in this fashion may be regenerated to give rise to whole * 
transqenic or genetically engineered plants. The engineered traits introduced into the transgenic 
plants by these techniques have proven to be stable and have also proven to be transmissible through 
normal Mendellian inheritance to the progeny of the regenerated plants. One such desirable trait is 
the production in the plant cells of desired gene products in vivo in the cells of the transqenic plants. 
For a chimeric gene to be effective, the foreign DNA sequence containing a coding region should be 
flanked by appropriate promotion and control regions. Commonly used plant cell transcription 
promoters include the nopaline synthase promoter from the T-DNA of A. tumefaciens and the 35S 
promoter from the cauliflower mosaic virus. 

In order for the newly inserted chimeric gene to express the protein for which it codes in the , 
plant cell, the proper regulatory signals must be present and in the proper location with respect to the 
gene. These regulatory signals include a promoter region, a 5' non-translated leader sequence and a 3' 
polyadenylation sequence. A promoter is a DNA sequence that directs the cellular machinery of a 
plant to produce RNA from the contiguous structural coding sequence downstream (3*) to (he t 
promoter. The promoter region influences the rate at which the RNA product of the gene and 
resultant protein product of the gene is made. The 3 1 polyadenylation signal is a non-translated region 
that functions in 
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the plant cells to cause the addition of polyadenylate nucleotides to the 3' end of the RNA to enable 
the mRNA to be transported to the cytoplasm and to stabilize the mRNA for subsequent translation 
of the RNA to produce protein. 

Other plant cell transformation techniques are directed toward the direct insertion of DNA 
5 into the cytoplasm of plant cells from which it is taken up, by an uncharacterized mechanism, into 
the genome of the plant. One such technique is electroporation, in 

which electric shock causes disruption of the cellular membranes of individual plant cells. Plant 
protoplasts in aqueous solution when subject to electroporation will uptake DNA from the 
surrounding medium. Another technique involves the physical acceleration of DNA, coated onto 

10 small inert particles, either into reqenerable plant tissues or into plant germline cells. 

The availability of cloned nucleic acid sequences encoding an enzyme involved in alkaloid 
synthesis allows for the potential manipulation of alkaloid contents. Furthermore, the availability of 
promoters useful for expressing genes in plants allows for the creation of chimeric molecules and 
transgenic plants, which in turn result in possible native plant production of desirable proteins. 

1 5 Previously reported work discloses cloning nucleotides encoding proteins involved in the 

biosynthesis of nicotine, and isolating such proteins. Approximately twenty or more cDNAs and/or 
genomic DNA fragments encoding different enzymes involved with alkaloid formation have been 
isolated (Chattopadhyay and Ghosh, 1998). For example, successful cloning of partial or full-length 
cDNA encoding ODC activity from tobacco was disclosed by (Malik et al, J. Plant Biochem. 

20 &Biotech. 5: 109-1 12 (1996)). Also, a relatively crude preparation of PMT (30-fold purification) has 
been subjected to limited characterization (Mizusaki et al., "Phytochemical Studies on Tobacco 
Alkaloids XIV. The Occurrence and Properties of Putrescine N-Methyltransferase in Tobacco 
Plants", Plant Cell Physiol., 12, pp. 633-40 (1971)). A process for purifying PMT is disclosed in US 
Patent No. 5,369,023, "Method of purifying putrescine n-methyltransferase from tobacco plant 

25 extract with an anion exchange medium", hereby incorporated by reference in its entirety herein. 
Several laboratories have reported the cloning of partial or full-length cDNAs encoding ADC (Bell 
and Malmberg ,1990; Rostogi et al., 1993; Perez-Amador et al., 1995; Nam et al., 1997; Watson and 
Malmberg, 1996). Comparisons of the amino acid sequences of ADC from various plants revealed a 
high degree of conservation among the various proteins, as well as homology to ODC (Malmberg et 

30 al., 1998). 

It is an object of the present invention to characterize the nucleotide and amino acid 
sequences of enzymes involved in the biosynthesis of nicotine in tobacco. It is also an object of the 
present invention to provide plant promoter regions that are capable of conferring high levels of 
transcription in rapidly dividing cells of transformed plants when coupled with a heterologous coding 
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sequence in a chimeric gene. Further, the invention is directed to chimeric genes incorporating such 
promoter regions, stable transfection of plants with these chimeric genes, and the plants and cells that 
are transfected, as well as seeds of such transfected plants. It is a further object to characterize sense 
and antisense nucleotides capable of regulating expression of genes encoding for enzymes involved 
5 in the biosynthesis of alkaloids. 

SUMMARY OF THE INVENTION 
Proteins involved in the biosynthesis of nicotine in tobacco N. tabacum are the subject of this 
invention. More specifically, the invention concerns four variants of putrescine N-methyltransferase 
1 0 (PMT1 , PMT2, PMT3, and PMT4), two variants of arginine decarboxylase (ADC 1 and ADC2), 

ornithine decarboxylase (ODC), S-adenosylmethionine synthetase (SAMS), NADH dehydrogenase, 
and phosphoribosylanthranilate isomerase. 

BRIEF DESCRIPTION OF THE FIGURES 

15 Figure 1. Genomic DNA gel blot analysis of the PMT gene family in N. tabacum cv. Xanthi. 

Total genomic DNA (30 \ig) was digested with Kpnl, EcdRI, or EcoRl and Kpnl, separated by 
agarose gel electrophoresis, and transferred to nylon membranes. The membrane was hybridized 
with a 32p_i a beled antisense strand probe covering the complete coding region of the NtPMTla 
cDNA. Identity of the hybridizing bands as determined by comparison to phage DNA digests is 

20 indicated. Molecular weights are given in kb. Note that Kpnl shifts only the NtPMTlb band in the 
gel blot because this restriction site is present ony in Exon 1 of NtPMTlb and not NtPMTla. 

Figure 2. Amino acid sequence alignment of N tabacum PMTs. Shown is a PILEUP alignment of 
the predicted amino acid sequences of the various tobacco PMTs. Amino acid residues that differing 

25 among the PMTs are shaded. NtPMTla, NtPMT2, NtPMT3, and NtPMT4 refer to the deduced 
amino acid sequences of the PMTs encoded by the NtPMTla, NtPMT2, NtPMT3, and NtPMT4 
genes, respectively, isolated from N. tabacum cv. Xanthi genomic DNA; cNtPMTla is the predicted 
amino acid sequence of the A41 1 cDN A (Accession No. D28506) isolated from N. tabacum cv. 
Burley 21 by Hibi et al. (1994). The location of the exon-intron boundaries are indicated by the dark 

30 vertical line. The nucleotide sequences for NtPMTla, NtPMTl, NtPMT3, and NtPMT4 appear in 
GenBank under the accession numbers AF126810, AF126809, AF126811, and AF126812, 
respectively 

Figure 3. Polyacrylamide gel electrophoresis analysis of PCR amplified genomic DNA fragments 
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encoding Exon 1 of PMT from various species of Nicotiana. PCR amplification was carried out as 
described in the Materials and Methods using Exon 1 -specific primers 1 and 2 and total genomic 
DNA isolated from N. tabacum, N. otophora, and N. tomentosiformis. The amplification products 
were separated by electrophoresis on 6.5% polyacrylamide gels, the gels fixed, and subject to 
5 autoradiography. The amplification products isolated from K iabacum cv. Burley 21 and N. tabacum 
cv. Xanthi were identical and only the amplication products from the reactions with N. tabacum cv. 
Burley 21 DNA are shown. Standards were generated in identical reaction conditions primed with 
plasmid DNA encoding the. various PMT genes isolated in this study. 

10 

Figure 4. Nucleotide sequence alignment of the S'-flanking regions of the N. tabacum PMT genes. 
Shown is a PILEUP alignment of the nucleotide sequences upstream of the initiating methionine 
(MET) codon of the four PMT genes isolated from N. tabacum cv. Xanthi. The proposed start site 
for transcription of the NtPMTla gene is indicated by the +1 above the sequences. The TATA-box 

15 and CCAAT-box motifs are boxed. Potential transcriptional regulatory elements identified by 
MOTIF search programs are also boxed and indicated by the following abbreviations:. PAL: 
palindromic sequences; G-Box: G-Box homologous sequences; MRE: metal-responsive 
element homolog. Nucleotides identical in three or more sequences are shaded. The polyguanine- 
rich region is underlined. Numbering is indicated to the right and is relative to the proposed start site 

20 ■ of each gene. 

Figure 5. RNA gel blot analysis of PM T transcript levels in various tissue:; Total RNA was isolated 
from various tissues of mature N. tabacum cv. Burley 21 and analyzed by gel blot analysis using a 
32 P-labeled NtPMTla cDNA coding region (Exons 2 to 8) probe capable of detecting all PMT 
transcripts. 

25 A. PMTtranscript levels in various tobacco plant tissues and/or organs. 

B. Induction of PMT expression in tobacco roots following topping. Abbreviations: HP, wild-type 
(NicINic2) Burley 21; LP, low alkaloid (niclnicT) mutant. The P-subunit of mitochondrial 
ATPase (P-ATPase) served as a control. 

30 Figure 6. Semi-quantitative RT-PCR analysis of PMT gene expression in roots of tobacco plant 
before and after topping. 

A. Shown is relative abundance of the individual PMT gene transcripts before and after topping. RT- 
PCR was carried out as described in the Material and methods using Exon 1 specific primers. 
Messenger RNA was amplified from total RNA isolated from the roots of wild-type (HP, 
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NiclNicT) Burley 21 and low alkaloid (LP, niclnicZ) Burley 21 tobacco plants. Far right lane 
represents size standards for the genes isolated by PCR amplification from plasmid DNA. The |$- 
subunit of mitochondrial ATPase (P-ATPase) served as a control. 

5 B. Bar graphs showing relative expression of the individual PMT genes following topping in both HP 
and LP tobacco roots. Abbreviations: HP, wild-type (NiclNicI) Burley 21 ; LP, low alkaloid 
{niclnic2) mutant. 

Figure 7. The nucleotide and predicted amino acid sequences of the transcribed portions of the N. 

10 tabacum cv Xanthi NtADCl and NtADC2 genes. Shown are the complete nucleotide and predicted 
amino acid sequence of the N. tabacum cv Xanthi NtADCl gene and where it differs from the 
NtADC2 gene sequence. The dots indicate nucleotide sequence identity and the stars indicate amino 
acid sequence identity. The proposed polyadenylation signal is underlined. The sequences terminate 
at the point of polyadenylation found in the full length cDNA (Wang, 1999; AF127239).The 

15 complete nucleotide sequences for the N, tabacum cv Xanthi NtADCl (AF127240) and NtADC2 
(AF127241) including the 5' and 3* flanking sequences appear in Genbank. 

Fig. 8. Comparison of the predicted amino acid sequences of arginine decarboxylases (ADCs) from 
various species. Shown is a PILEUP alignment of the predicted amino acid sequence of the N. 
20 tabacum cv Xanthi NtADCl gene (AF127240) aligned to the predicted ADC protein sequences from 
N. sylvestris (AB12873), Arabidopsis thaliana (AF009647), Avena sativa (oat) (X56802), 
Lycopersicon esculentum (tomato) (L16582) and Escherichia coli (M3 1770). Amino acid residues 
conserved among the various ADC are shaded. 

25 Fig. 9. Gel blot analysis of ADC transcript levels in the roots of wild-type and low alkaloid mutant 
Burley 21 tobacco before and after topping. Total RNA was isolated from the roots of mature wild- 
type and low alkaloid mutant N. tabacum cv. Burley 21 and analyzed by gel blot analysis using [a- 
32 P]-dCTP labeled probes recognizing the coding region of ADC or the |}-subunit of tobacco 
mitochondrial ATP synthase (Boutry and Chua, 1985). Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the atp2 control band in each lane. The experiment was conducted twice with 
different total RNA samples. 
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Fig. 10. Differential expression of NtADC-1 and NtADC-2 in various tissues of tobacco. Expression 
of the NtADC-1 and NtADC-2 genes was analyzed using semi-quantitative RT-PCR and gene 
specific primers capable of discriminating between transcripts arising from the two genes. Panel A 
shows control reactions demonstrating primer specificity in the PCR reactions using plasmids 

5 containing the NtADC-1 and NtADC-2 coding sequences. The numbers above the lane refer to the 
specific primer combinations as described in the Materia and methods. Panel B shows the results of 
RT-PCR reactions using first strand cDNA synthesized from total RNA extracted from either root, 
leaf, or flowers. As a internal control, primers specific for the atp2 gene transcript were include in the 
amplification reactions. All reactions were carried out within the linear range of template 

10 amplification as determined by varying template amount, amplification time, and temperature as 
described in Riechers and Timko (1999). 

Fig. 11. Genomic DNA gel blot analysis of the ODC gene family in N. tabacum. Total genomic 
DNA (30 Mg) was digested with EcoRl or HindUl, fractionated by agarose gel electrophoresis, - 
1 5 transferred to nylon membranes and hybridized with an a- 32 P-dCTP labeled probe encoding full- 
length ODC cDNA as described in the Materials. The mobility of molecular weights standards are 
given to the right of the figure in kilobases (kb). 

Fig 12. Comparison of the nucleotide and predicted amino acid sequences of the NtODC-1 and 
20 - NtODC-2 genes. Shown are the nucleotide and predicted amino acid sequences of the NtODC-J 

(AF233850) and NtODC-2 (AF233849) genes. In the figure, the complete amino acid sequence of 
the pODC2 is given and the pODCl sequence is given only where it differs. The start site of 
transcription is designated as +1 and the poly(A) addition site is indicated by the arrow. Within the 
relevant regions of homology, nucleotide differences between the NtODC-I and NtODC-2 genes are 
25 in bold lettering. The proposed TATA-box, and polyadenylation signal are shaded. 

Fig. 13. Protein sequences alignment of ornithine decarboxylases (ODCs) from various species. 
Shown is a PILEUP alignment of the predicted amino acid sequences of the TV. tabacum cv. Xanthi 
pODC2 protein (AF233849) with the ODCs from N. tabacum cv. SC58 (Y10472) and cv. BY-2 
30 (AB03 1066), Lycopersicon esculentum (tomato) (AF030292), Datura stramonium (jimsonweed) 
(X87847), Saccharomyces cerevisiae (NP_012737), and humans {Homo sapiens; AAA59966). 
Amino acid residues conserved among the various ODCs are shaded. 
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Fig. 14. Gel blot analysis of ODC transcript levels in various tissues of mature tobacco plants and 
in the roots before and after topping. Total RNA was isolated from various tissues of mature N. 
tabacum cv. Burley 21 and analyzed by gel blot analysis using an cc- 32 P-dCT? labeled coding region 
probes for ODC. (A) Transcript levels in various organs of wild-type tobacco: R, root: S, stem ; L, 
5 leaf; SE, sepal; PE, petal; O, ovary; S, stamen; and AN, anther. (B) Transcript levels in roots of 
Burley 21 tobacco plants before and after topping. RNA gel blot analysis of the tissues-specific 
distribution and post-topping expression of transcripts encoding ODC in tobacco. As a control, the 
blots were also probed with radioactively labeled probes encoding the alkaloid biosynthesis enzyme 
putrescine N-methyltransferase (PMT) and a root specific p-glucosidase (TBG-1). 

10 

DETAILED DESCRIPTION OF THE INVENTION 
Nucleic acid sequences have been isolated from tobacco that encode important enzymes in 
nicotine and total alkaloid formation, including PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, 
and SAMS. Also identified are cDNA fragments encoding partial segments of NADH 
15 dehydrogenase and phosphoribosilanthronilate isomerase. Also identified are promoter regions for 
the nucleotides encoding PMT1, PMT2, PMT3, PMT4, and ADC2. All of these nucleic acids are 
isolated from Nicotiana tabacum L. 

"Promoter" and "promoter region" are terms used interchangeably herein to refer to a DNA 
sequence that regulates expression of a selected DNA sequence operably linked to the promoter, and 
20 which effects expression of the selected DNA sequence in cells. The term also encompasses the 
5'untranslated region that may be transcribed into mRNA but is not translated. 

"Protein", "polypeptide", and "peptide" are used interchangeably herein when referring to a 
gene product. 

In one aspect, the invention features isolated nucleic acid molecules encoding for PMT1, 
25 PMT2, PMT3, PMT4, ADC1, ADC2, ODC, and SAMS, a fragment of NADH dehydrogenase and a 
fragment of phosphoribosilanthronilate isomerase. The disclosed molecules can be non-coding (e.g. 
probe, antisense or ribozyme molecules) or can code for a functional enzyme. In one embodiment, 
the nucleic acid molecules can hybridize to the nucleic acid sequences encoding for PMT1, PMT2, 
PMT3, PMT4, ADC1, ADC2, ODC, SAMS, a fragment of NADH dehydrogenase, or a fragment of 
30 phosphoribosilanthronilate isomerase or to the complements of these nucleic acid sequences. In a 

preferred embodiment, the hybridization is conducted under mildly stringent or stringent conditions. 

In further embodiments, the nucleic acid molecule is at least 50%, 60%, 70%, 80% and more 
preferably at least 90% or 95% homologous in sequence to the nucleic acid sequences encoding for 
PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, a fragment of NADH dehydrogenase, or 
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a fragment of phosphoribosilanthronilate isomerase or to the complements of these nucleic acid 
sequences. In another embodiment, the nucleic acid encodes a polypeptide that is at least 50%, 60%, 
70%, 80% and more preferably at least 90% or 95% similar in sequence to the amino acid sequence 
of PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, the fragment disclosed herein of 
NADH dehydrogenase, or the fragment of phosphoribosilanthronilate isomerase disclosed herein. 

In another embodiment, the invention features isolated polypeptides, preferably substantially 
pure preparations, encoded for by the nucleic acid sequences of the invention. Particularly preferred 
are those polypeptides encoded for by the nucleic acid sequences identified by (SEQ. ID. NO. 2), 
(SEQ. ID. NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 11), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15), 
(SEQ. ID. NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) 
or comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) or (SEQ. ID. NO. 24). In particularly preferred 
embodiments, the subject polypeptides can aid in regulating the production of alkaloids, particularly 
nicotine, in plants. In one embodiment, the polypeptide is identical to or similar to the protein 
represented by the amino acid sequences of (SEQ ID NO. 3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), 
(SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) 
or (SEQ. ID. NO. 24). In a preferred embodiment, the polypeptide is encoded by a nucleic acid that 
hybridizes with a nucleic acid represented in. 

The polypeptides of the present invention can comprise full length proteins, such as represented 
by (SEQ ID NO. 3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), 
(SEQ. ID. NO. 16), (SEQ. ID. NO. 19), (SEQ. ID. NO. 22) and (SEQ. ID. NO. 24) , or can comprise 
one or more fragments corresponding to one or more particular motifs/domains, or to arbitrary sizes, 
e.g., at least 5, 10, 25, 50, 100, 150, or 200 amino acids in length. . 

Another aspect of the invention features chimeric genes comprised of a promoter for the genes 
for PMT2, PMT1, PMT3, PMT4, or ADC2. Yet another aspect of the invention features chimeric 
genes or chimeric molecules comprised respectively of the functional gene encoding for or the 
protein PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, NADH dehydrogenase and/or 
phosphoribosilanthronilate isomerase. 

The invention also concerns isolated and purified promoter regions for tobacco Beta- 
glucosidase and their use in chimeric genes or chimeric molecules. 

Another aspect of the invention involves vectors capable of transporting another nucleic acid to 
which a vector has been linked. Preferably, the vectors comprise the nucleic acid sequences of the 
invention or their complements. 
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The invention also features transgenic plants and their seeds that include (and preferably 
express) a heterologous form of PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, NADH 
dehydrogenase and/or phosphoribosilanthronilate isomerase. The present invention also 
encompasses transgenic plants that contain in their genome a chimeric gene construction 
5 incorporating the nucleic acid encoding PMT1, PMT2, PMT3, PMT4, ADC1, ADC2, ODC, SAMS, 
NADH dehydrogenase and/or phosphoribosilanthronilate isomerase. Such transgenic plants and their 
seeds may be useful to natively produce enhanced quantities of desirable exogenous proteins, such as 
compounds useful for pharmaceutical purposes, or proteins that provide herbicide resistance. 

Another feature of the invention is the use as probes of the DNA sequences disclosed herein or 
10 oligonucleotide fragments thereof. Probes may be useful to obtain additional gene family members 
or locate homologous genes in tobacco or other plant species. Copies of related genes can be 
obtained from existing genomic libraries or the genomic libraries can be constructed. In one 
embodiment, an isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
15 sequence found in the sequences of the invention is used as a probe. 

Another feature is use of the polypeptides of the invention in an assay, such as an assay to 
identify modulators of enzyme activity in plants. 

Other features and advantages of the invention will be apparent to those of skill in the art. 
The nucleotide and amino acid sequences of the invention are disclosed herein in the Sequence 
20 Listing, text, and the figures. Specific sequences of the invention are provided in the. attached 
Sequence Listing and can be understood to represent promoters, nucleic acids, and proteins 
respectively relating to the following proteins: PMT2 (SEQ. ID. NOS. 1, 2, and 3); PMT1 (SEQ. ID. 
NOS. 4, 5, and 6); PMT3 (SEQ. ID. NOS. 7, 8, and 9); PMT4 (SEQ. ID. NOS. 10, 1 1, and 12); 
SAMS (SEQ. ID. NOS. 13 and 14 ); ODC (SEQ. ID. NOS. 15 and 16); ADC1 (SEQ. ID. NOS. 17, 
25 18, and 19); ADC2 (SEQ. ID. NOS. 20, 21, and 22); ADC1 mRNA (SEQ. ID. NOS. 23 and 24); 
NADH dehydrogenase (SEQ. ID. NO. 25); and PAI (SEQ. ID. NO. 26). If only two sequence 
identifiers are provided for a protein, those sequences represent the nucleic acid and the protein 
respectively. If three identifiers are provided, the identifiers represent promoter, genomic or cDNA 
nucleic acid, and peptide sequences, respectively. If only one identifier is provided, it represents a 
30 DNA fragment coding for the protein or portions of it. 

For other reference, the sequences may be found at the following records in GenBank at the 
following Accession Numbers, which records are hereby incorporated in their entirety herein: 
AF126810 (NtPMTl); AF126809 (NtPMT2); AF126811 (NtPMT3); AF126812 (NtPMT4), 
AF176908 (NtomPMTXNicotiana tomentosiformis); AF76909 (NotoPMT)(Nicotiana otophora); 
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AF127239 (ADC); AF 127240 (ADC 1); AF127241 (ADC2); AF127242 (ODC); AF233849 (ODC2); 
AF233850 (ODC1); and AF127243 (SAMS). 

The following experimental discussion is presented to better illustrate the invention. 
I. PMT 

The present invention features the characterization of four members of the nuclear gene family 
encoding PMT in tobacco N. tabacum. The nucleic acid sequences encoding PMT and the amino 
acid sequences for the proteins are reported herein and can also be found in the DDBJ, EMBL, and 
GenBank Nucleotide Sequence Databases under the accession numbers for NtPMTl a, NtPNTH, 
NtPMT3 y and NtPMT4 at AF126810, AF126809, AF126811, and AF1268 12, respectively. The 
complete coding region and immediate 5 - and 3 f - flanking regions are characterized. 

As the discussion below shows, all four PMT genes present in the N. tabacum genome are 
expressed in the roots of wild-type plants and differentially regulated in tobacco lines expressing 
either high or low total alkaloid contents. 

Materials and Methods 

Plant materials 

Seeds of N. sylvestris, N. otophora, and N. tomentosiformis were obtained from the USDA-ARS 
national tobacco germplasm collection (Oxford, NC). N. tabacum cv. Burley 21 and N. tabacum cv. 
Xanthi seeds were kindly provided by Glenn Collins, University of Kentucky. Tobacco plants used 
for DNA isolation were grown in a soil: vermiculite mixture in the greenhouse under natural lighting 
conditions. Plants used for RNA extraction were grown in Moltan Plus (Moltan Co., Middleton, 
TN). 

Gel blot analysis of genomic DNA 

Young leaves were collected from greenhouse grown tobacco (N. tabacum cv. Xanthi) plants and 
total genomic DNA was isolated from freshly-harvested tissues using a modification of the CTAB 
extraction method (Dellaporta et a/., 1983). Approximately 30 ng of total DNA was digested with 
EcoRl, Kpnl> or EcdBl and Kpnl and the digestion products separated by electrophoresis through a 
0.75% agarose gel. Restricted and size-fractionated DNA was denatured and transferred to Nytran+ 
nylon membranes (Schleicher and Schuell, Keene, NH) by capillary blotting in 0.4N NaOH 
overnight. Membranes were prehybridized in 0.25M NajHPO, (pH 7.4), 7% SDS, 1 mM Na^DTA 
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for at least 2 hr, then hybridized overnight at 65°C in the same buffer with 2-3 x 10 6 cpm/mL of a 
32 P-labeled single-stranded probe (antisense DNA strand). The probe was prepared by the method of 
Bednarczuk et al (1991) using a primer (Table 1, primer 4) designed from the 3' end of the 
NtPMTla coding region (Exon 8) and the full-length coding region of the NtPMTla cDNA as 
template. The NtPMTla cDNA was generated by RT-PCR using synthetic oligonucleotide primers 
based on the N- and C-terminal sequences of the A41 1 cDNA reported by Hibi et al (1994) and 
RNA template isolated from N. tabacum cv. Burley 21 roots. Membranes were washed at a final 
stringency of 0.1 x SSC, 0.1% SDS at 65 °C. Hybridizing bands were visualized by autoradiography 
and/or imaged using a Molecular Dynamics Phosphorlmager (Model 445 SI, Sunnyvale, CA). 

Genomic library construction and phage isolation 

A library of N tabacum cv. Xanthi genomic DNA fragments constructed in EMBL3 was purchased 
from Clontech (Palo Alto, CA) and a total of 1 .1 x 10 6 recombinant phage were screened by plaque 
hybridization using random-primed 32 P-labeled NtPMTla cDNA as probe (Sambrook et al., 1989). 
Prehybridization, hybridization, and washing conditions were as described above. Positive 
hybridizing phage were plaque purified by subsequent rounds of rescreening and DNA was prepared 
from 18 independently isolated phage. The phage DNA was characterized by restriction analysis and 
DNA gel blot analysis and fragments containing the sequences encoding PMT were subcloned into 
pBluescript KS vectors for further analysis. 

1 Comparison of the hybridizing fragments present in the 1 8 recombinant phage to the 
hybridization pattern obtained by genomic DNA blot analysis indicated that only three of the PMT 
genes suspected to be present in the N. tabacum genome were recovered from the library screen. To 
obtain sequences encoding NtPMTla, a subgenomic library was constructed from N. tabacum cv. 
Xanthi DNA. The library consisted of gel-purified 2.5-3.5 kb EcoRl fragments ligated into X_ZAP 
II vector arms and packaged using Gigapack III Gold packaging extracts according to the 
manufacturer's instructions (Stratagene, La Jolla, CA). The primary library was amplified once in E, 
coli XL 1 -Blue MRF strain (Stratagene) and screened as described above, except that a random- 
primed 32 P-labeled NtPMTla cDNA Exon 1-specific probe was used (Table 1). Exon 1 had 
previously been amplified by PCR using primers 1 and 2 (Table 1) and the NtPMTla cDNA as 
template. The recombinant phage that hybridized with the probe was isolated from the sublibrary by 
two more rounds of plaque purification, and the pBluescript phagemid containing the approximate 
3.1 kb EcdBl genomic fragment with the NtPMTla gene was excised from the XJLAP II phage 
vector using the in vivo excision protocol described by Stratagene. 
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DNA sequence analysis 

Unless otherwise noted, DNA sequencing was performed with double-stranded plasmid DNA 
5 templates using fluorescent dye terminator technology (dRhodamine Terminator Cycle Sequencing 
Ready Reaction kit) on an ABI 310 DNA sequencer (Perkin-Elmer Applied Biosystems). For 
analysis of PCR products, following electrophoretic separation of amplification reaction products, 
the bands of interest were excised from the polyacrylairride gels, the DNA extracted using the 
Quiagen Gel Extraction Kit, and the recovered DNA used as sequencing template. Sequencing was 

10 performed using AmpliTaq DNA polymerase and fluorescent dye terminator technology (as 

described above) and primers 1 and 2 (Table 1) specific for Exon 1. Nucleotide and amino acid 
sequences were analyzed and aligned using either the Clustal method and Lasergene software 
(DNAStar Inc., Madison, WI) or the PELEUP and ALSCRIPT (Genetics Computer Group, Madison, 
WI) sequence analysis package (Version 9.0). Transcription factor binding site homologies were 

15 identified in promoter DNA sequences by searching the transcription factor database using the GCG 
program. 

^ 

RNA gel blot analysis 

20 For RNA analysis, roots and other tissues were harvested from mature wild-type (HP; NiclNic2) and 
low alkaloid mutant (LP; niclnic2) Burley 21 tobacco plants. For topping experiments, the stem was 
cut and the top one-third of the plant was removed just prior to flower opening. Roots were 
harvested just prior to topping (0 hr control) and at various times after decapitation. The tissue was 
immediately frozen in liquid nitrogen and stored at -80°C until RNA extraction and isolation. 

25 Total RNA was isolated from vegetative organs and floral structures of HP and LP Burley 21 

tobacco using the TRI-reagent (Molecular Research Center Inc., Cincinnati, OH) and quantified 
spectrophotometrically by measuring A260. Total RNA (5 ng) was electrophoresed through 1.2% 
agarose gels (containing 0.4 M formaldehyde) and transferred to Nytran+ nylon membranes. 
Following prehybridization the membranes were hybridized with a single-stranded NtPMTla cDNA 

30 antisense probe (corresponding to the antisense strand of Exons 2 to 8 of the NtPMTla cDNA coding 
region) as described above. As a control to quantify and normalize RNA levels in each lane, the blot 
was hybridized with a 400-bp probe derived from the P-ATPase cDNA using primers 6 and 7 (Table 
1) as described below. 
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Semi-quantitative RT-PCR analysis of individual PMT transcript levels 

Total RNA (1 jig) extracted from the roots of HP and LP Burley 21 tobacco plants was reverse- 
transcribed into first-strand cDNA at 42°C using Superscript II reverse transcriptase (Gibco BRL) 

5 according to the manufacturer's protocol. Two gene-specific primers were employed in the reactions: 
primer 5 capable of recognizing Exon 3 of the PMT genes and primer 8 specific for Exon 8 of the 
nuclear gene encoding the P-subunit of mitochondrial ATPase from N. plumbaginifolia (NpATP2.I) 
and N. sylvestris (NsATP2J) (Boutry and Chua, 1985; Lalanne et at., 1998). The P-ATPase 
transcript served as an internal reference (constitutively-expressed control) to determine loading 

10 accuracy and to normalize expression levels (Kinoshita et al\ 1992) Following first strand cDNA 
synthesis, two sets of nested primers (0.4 each primer) were used to amplify the PMT and P- 
ATPase transcripts: primers 1 and 2 (Table 1) recognized Exon 1 in all five PMT transcripts and 
gave products ranging in size from 220 bp to 420 bp and primers 6 and 7 amplified an approximately 
400-bp region encompassing a portion of Exons 6 to 8 of the p-ATPase coding region. 

15 Amplification was carried out for 25 cycles using the following reaction conditions: denaturation at 
95°C for 1 min, primer annealing at 60°C for 35 sec, and extension at 72°C for 1.5 min; a final 
extension was conducted at 72°C for 6 min. Amplification products were radioactively labeled by 
spiking the PCR reaction with 1 0 *iCi 32P-dCTP. Aliquots of the PCR reaction were analyzed on a 
6.5% non-denaturing polyacrylamide/lX TBE gel and electrophoresed at 600 volts. The reaction 

20 conditions were optimized to provide amplification of both PMT and $-ATPase transcripts in the 
linear range of the reaction by varying the levels of first strand cDNA template, annealing 
temperature, and number of cycles of amplification as described in Kinoshita et al (1992). 
Molecular weight standards were prepared by PCR amplification using the same primers and 
protocol described above and plasmid DN A templates containing the PMT encoding genomic 

25 fragments, as well as genomic DNA from the various Nicotiana species indicated in the text. 

Following electrophoresis, the polyacrylamide gels were fixed in 5% MeOH, 7.5% acetic acid 
for 30 min, dried overnight, and used to expose X-ray film. PMT band intensities were quantified 
using phosphorimager analysis (Molecular Dynamics) and normalized relative to the intensities of 
the P-ATPase control band in each lane. The experiment was conducted twice with different total 

30 RNA samples, and representative results are presented from one of the two experiments. 

Results 

PMT gene structure and organization in N. tabacum 
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Gel blot analysis of total genomic DNA isolated from N. tabacum cv. Xanthi, hybridized with a 
radioactively-labeled cDNA (NtPMTla) encoding the complete coding region of putrescine N- 
methyltransferase (PMT) showed the presence of five major hybridizing bands in Kpnl or EcoRL 
digested DNA, consistent with the presence of a small multigene family in the N. tabacum genome 
5 (Figure 1). 

As part of our initial characterization of the gene family encoding PMT in N. tabacum, an 
EMBL3 genomic library, prepared from N. tabacum cv. Xanthi DNA, was screened using the 
NtPMTla (A41 1 homologous) cDNA as probe. From a total of 18 recombinant phage isolated, three 
phage were recovered that contained genomic fragments encoding the NtPMT2, NtPMT3 and 

10 NtPMT4 genes. The three PMT genes were completely encoded within a unique sized EcoRl 

fragment within the phage DNA insert which allowed for the correlation of each with a hybridizing 
restriction fragment on the gel blot of N. tabacum genomic DNA (Figure 1). The complete coding . 
region and immediate 5 f and 3' non-coding sequences of the three genes were determined and found 
to encode full-length PMT proteins (Figure 2). Each PMT gene consisted of 8 exons and 7 introns, 

15 consistent with the gene structure reported previously for the PMT genes from N. sylvestris 

(Hashimoto et al, 1998a). Comparison of the deduced amino acid sequences (Figure 2) revealed 
that the encoded PMT proteins were extremely similar over their entire length, with the only 
significant variability in primary sequence localized to the extreme N-terminal region of the protein. 
This region, completely encoded within Exon 1, contains a variable number of an 1 1 amino acid 

20 repeat with a consensus sequence of NGHQNGTSEHQ. The function of the repeated sequence is 

unknown, but is apparently inconsequential to enzyme function, since the number of repeats does not 
influence activity and PMTs characterized from other species do not contain the repeated element 
(Hashimoto et al, 1 998a; Suzuki et al, 1 999a). 

Multiple rounds of screening of the EMBL3 genomic library failed to yield additional 

25 hybridizing phage containing sequences encoding the other two PMT genes thought to be present in 
the N. tabacum genome and, therefore, a directed cloning approach was pursued using a subgenomic 
library constructed from EcoRl fragments isolated from N. tabacum cv. Xanthi. From this 
hybridization screening, a phage containing the approximately 3. 1 kb EcoRI fragment encoding 
NtPMTla was recovered. The coding region of the NtPMTla gene was found to be identical to the 

30 A41 1 cDNA (Hibi et al, 1994), with the exception of a single base change in Exon 6 that results in a 
conservative amino acid substitution. This difference could be the result of minor differences among 
cultivars used in the two studies (i.e., Xanthi vs. Burley 21). Translation of the open reading frame 
contained in NtPMTla showed that it encoded a protein containing four N-terminal 1 1 amino acid 
repeats, similar to Exon 1 of the PMT gene present in N tomentosiformis (Hashimoto et al, 1998a). 
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Given the observation that NtPMTla encoded a homolog of the PMT gene present in N. 
tomentosiformis, the nature and possible evolutionary origin of the remaining PMT gene present in 
the N. tabacum genome was brought into question. From our expression studies (described in detail 
below), we had determined that five distinct PMT encoding transcripts were present in the roots oiN 

5 iabacum, four of which could be accounted for based upon the length of the Exon I coding region in 
the four PMT genes isolated and characterized in our studies described above. The fifth transcript 
was similar in size to that encoded by NtPMTla and, therefore, was designated NtPMTlb, Since the 
variability in PMT gene structure is primarily localized within Exon 1 , we used a PCR-based strategy 
to analyze the PMT gene structure and family size in N. otophora, the other possible progenitor of N. 

10 iabacum. As shown in Figure 3, five distinct PCR products were detected in the electrophoretic 
pattern of amplification products generated from N. tabacum genomic DNA using Exon 1 specific 
primers (Table 1). Consistent with our studies described above and the previous work of Hashimoto 
et ah (1998a), three PCR products were detected in the electrophoretic pattern of amplification 
products generated from N sylvestris genomic DNA, and a single band was recovered from N. 

15 iomentosiformis genomic DNA. Amplification of genomic DNA-from. N otophora using Exon 1 
specific primers also yielded only a single band, whose electrophoretic mobility was most similar to 
that of the NtPMTlb derived product. 

Analysis of PMT gene intron and flanking sequences 

20 

The location of the seven introns within the protein coding region of the five PMT genes in N. 
tabacum is identical and appears to be conserved among PMT genes from different Nicotiana 
species. There is also little variation in the nucleotide sequences that comprise the Exon-Intron 
splice junctions in the various PMT genes in N. tabacum (Table 2). The high degree of nucleotide 

25 sequence similarity recognized among PMT genes within their coding regions is also present within 
their introns and immediate 5' and 3' flanking sequences (Table 2 and Figure. 4). In general, a greater 
level of sequence identity is found in the introns of the NtPMTl, NtPMT3, and NtPMT4 genes, than 
in pair-wise comparisons among the introns of the other members of the N tabacum PMT gene 
family. The observed conservation in the intron sequences of the NtPMT2, NtPMT3, and NtPMT4 

30 genes is consistent with their origin from the same progenitor species (N. sylvestris). One interesting 
exception occurs within Intron 6, where the length of the intron and the sequence similarity is more 
conserved between NtPMTla and NtPMT4, than between NtPMT4 and NtPMT2 or NtPMT3. 

Approximately 1 kb of nucleotide sequence was determined 5' to the coding regions of the 
NtPMTla, NtPMT2, NtPMT3, and NtPMT4 genes (Figure 4). By comparison to the 5'-untranslated 
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region (UTR) present in the A41 1 cDNA, we set the start site for transcription initiation at 
approximately 57 nucleotides upstream of the MET start codon in NtPMTla and NtPMT3, and either 
69 or 60 nucleotides upstream in NtPMT2 and NtPMT4. The major distinguishing feature between 
the 5-UTRs in the various genes is the presence or absence of a 17 bp sequence in the gene. An 

5 appropriately placed TATA-box can be easily recognized 45 bp 5' to the initiation site in all four 

genes. Within the first 200-250 bp upstream of the TATA box, a high level of sequence conservation 
is found to exist among the promoter regions in the four genes. After this point, a clear difference 
can be observed between the NtPMTla promoter and the remaining three genes, and by 400 bp 
upstream, little similarity can be found among any of the gene family members. 

1 0 Analyzing the proximal regions of the various PMT promoters with various motif scanning 

software identified several G-box-like sequences (Foster et a/., 1994; Kim et a/., 1992; Menkens et 
a/., 1995; Staiger et al, 1989; Williams et a/., 1992) at various positions among the PMT promoters, 
and a potential metal response element (MRE) (positions -75 to -66; numbering relative to the 
NtPMTla promoter sequence) in three of the four PMTs (Cizewski-Culotta and Hamer, 1989; Thiele, 

15 1992). An unusual 17 nucleotide stretch of guanine occurs at positions -259 to -243 in the NtPMTla 
gene promoter followed upstream by a purine-rich region (positions -332 to -263). In the NtPMTS 
promoter a 14 bp palindromic sequence (positions -497 to -484) was detected. PMT gene expression 
has been reported to increase in root tissues following treatment with methyl jasmonate (Imanishi et 
aU 1998). However, none of the sequence motifs reported to confer methyl jasmonate- 

20 . responsiveness in other plant genes (Mason et a/., 1993; Rouster et al, 1997) were detected in the 
PMT promoters. 

Comparison of the available nucleotide sequence information from the 3'-flanking regions of 
the various PMT genes in N tabacum revealed that the 3-UTRs in the NtPMT2, NtPMT3, and 
NtPMT4 genes of N tabacum share approximately 81-94% identity with each bther and are 

25 essentially identical to those reported for N. sylvestris PMTs by Hashimoto et al (1998a). The major 
distinguishing feature among the various genes is the presence of two short (20 bp and 4 bp) 
deletions in the NtPMT2 gene, which lowers the percent identity. The 3-UTR of NtPMTla is 
identical to that reported for the A41 1 cDNA (Hibi et al, 1994) and 81-94% identical to the other 
PMT genes in the N. tabacum genome. Unfortunately, no sequence information is currently 

30 available for the 3'-UTR of the N otophora or N. tomentosiformis PMT genes. 

Regulation of PMT gene expression 

To determine whether the members of the PMT gene family in N. tabacum were differentially 
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expressed, a series of experiments were carried out to define the temporal and spatial distribution of 
transcripts arising from the five genes. Shown in Figure 5A are the results of gel blot analysis of 
total RNA extracted from various tissues of mature Burley 21 tobacco plants hybridized with 
radioactively-labeled probe capable of detecting all five PMT transcripts. Consistent with previous 
studies (Hashimoto et ah, 1998b; Hibi et ah, 1994), PMT expression is localized exclusively to roots. 
When maturing wild-type (HP) Burley 21 plants are topped (i.e., the floral meristem and upper 1/3 of 
the stem are removed), a dramatic increase in PMT transcript abundance is observed within 2 hr, 
reaching a maximal level of accumulation by 12-24 hr. Two size transcripts are detected on the gel 
blots, reflecting the small difference in message size that occurs as a result of the difference in size of 
Exon 1 among the genes. 

In addition to examining PAfTgene expression in wild-type plants, we also examined 
expression in a low nicotine-producing (LP) mutant of Burley 21 (Legg and Collins, 1971). The low 
nicotine Burley 21 line harbors mutations at two independent loci (nicl and nic2) thought to be 
global regulators of gene expression involved in alkaloid formation. As shown in Figure 6B, topping 
of the low nicotine mutant {niclnicl) Burley 21 did not cause an increase in PMT transcript 
abundance as observed in wild type plants. Thus, it appears that Nicl and Nic2 are likely involved in 
regulation of PMT expression in the very least, and may also be involved in the regulation of other 
genes in the alkaloid biosynthetic pathway. Whether this is a direct effect (e.g., transcriptional 
activation) or indirect remains to be determined. 

In order to determine the extent to which the individual members of the gene family 
contributed to the general pattern of expression described above, a semi-quantitative RT-PCR 
strategy (Kinoshita et ah, 1992) was used to detect and quantify the levels of the individual PMT 
transcripts in the roots of both wild-type (HP) and low alkaloid (LP) Burley 21 tobacco. This 
approach takes advantage of the feet that Exon 1 is variable in length among the various PMT genes 
(Figure 2), allowing for their individual detection and quantitation following polyacrylamide gel 
electrophoresis and autoradiography. 

Five RT-PCR products (representing Exon 1 from each of the five genes present in N. tabacum) 
were detected in the electrophoretic profiles of amplification products derived from reactions using 
either HP or LP Burley 21 root RNA (Figure 6 A). All five PMT genes present in the K tabacum 
genome were expressed in the roots of wild-type plants, and topping resulted in a differential 
accumulation of transcripts derived from each gene. Among the five genes, transcripts derived from 
the NtPMT2 and NtPMTlb showed the greatest increase in abundance rising approximately 3-fold 
during the first 24 hr post-topping, whereas levels of the NtPMTla and NtPMT4 transcripts changed 
little in response to topping (Figure 6B). In the LP mutant, little or no effect was observed on the 
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levels of the various PM T transcripts following topping, although in some cases (e.g., NtPMTla) a 
small but likely insignificant depression in transcript abundance was detected. Thus, it appears that 
all five genes contribute to PMT activity levels within the root. 

5 II. ADC 

The present invention features the characterization of two members of the nuclear gene family 
encoding ADC in tobacco N. tabacum L. As the following discussion shows, ADC2 is preferentially 
expressed in roots and accounts for the major portion of ADC transcripts present. Furthermore, 
analysis of ADC transcript levels in roots of low and high nicotine producing lines showed that ADC 
10 expression is under the control of the Nicl Nic2 regulatory loci. 

Materials and methods 

Plant growth and tissue preparation 

15 ' ' - • '• " : ^ ' 

Seeds of N. tabacum cv. Xanthi, wild-type and low alkaloid nicl nic2 mutant N. tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington). Tobacco plants used for 
DNA isolation were grown in soilrvernriculite mixture in the greenhouse under natural lighting : 
conditions. Plants used for RNA extraction were grown either in Moltan Plus (Moltan Co., 

20 Middleton, TN) or hydoponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
plant) from tobacco plants just prior to blooming. Plant tissues were collected from fully matured 
individuals, frozen in liquid nitrogen, and stored at -80 °C until used for RNA preparation (see 

25 below). 

Screening of genomic libraries and phage characterization 

A genomic library constructed in X EMBL3 from N. tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
30 Palo Alto, CA) was screened by plaque hybridization (Sambrook et aL 9 1989) using an [a- 32 P]- 

dCTP-labeled, 2.7 kb EcoKl-Xhol fragment from plasmid PR24 as probe. PR24 encodes a full length 
ADC cDNA isolated from the roots of wild-type N. tabacum cv. Burley 21 (Wang, 1999). 
Hybridization was performed at 65 °C for 16 h in a solution containing 0.25 M Na2HP0 4 (pH 7.2) 
and 7% (w/v) SDS. Following hybridization, the membranes were washed twice in 2 x SSC, 0. 1% 
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SDS for 15 min at room temperature, once in 0.2 x SSC, 0.1% SDS for 30 min at 65°C. Hybridizing 
phage were picked and plaque purified through three subsequent rounds of hybridization screening. 
Phage DNA was isolated from plaque purified phage using a Qiagen Phage Midi Preparation Kit 
(Qiagen, Germany) and insert DNA characterized by restriction mapping and DNA gel blot analysis. 
5 The relevant hybridizing bands in each phage were cloned into pBluescript SK+ vectors for further 
analysis. 

Nucleic acid sequencing and analysis 

10 Nucleotide sequencing was carried out manually using the Sequenase Version 2.0 protocols 

according to the manufacturer's protocol (United States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 

15 sequence analysis programs (Altschul et al, 1990; Gish and States, 1993) and protein sequence 

alignments were carried out using the PILEUP program (Genetics Computer Group Sequence - ; 
Analysis package, Version 9.0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 
- 20 carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 
25 (Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol For 
RNA gel blot analysis, aliquots (10 ng) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1.2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 
30 membranes were prehybridized in 7% SDS, 0.25 M Na^O,, pH 7.2 for 2-4 hours at 65 °C. * 

Hybridization was carried out in the same buffer in the presence of 32 P-labeled probes for 1 6 hr at 
65 °C. The membranes were washed under high stringency conditions and subject to 
autoradiography at -80 °C for approximately 48 h. 

For gel blot analysis, [a- 32 P]-dCTP -labeled probes were prepared by random primed labeling 
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(Random Primed Labeling Kit, Boehringer Mannheim, Indianapolis, IN) using 25-50 ng of a 2.7 kb 
EcoRl-Xhol fragment derived from PR24 and a 460 bp fragment amplified from the p- subunit of the 
tobacco mitochondrial ATP synthase gene (atp2) (Boutry and Chua, 1985). 

5 Semi-quantitative RT-PCR analysis of NtADCJ and NtADC2 transcript levels. 

Total RNA (2 jig) from roots, leaves, or floral parts was reverse transcribe at 40 °C for 1 h in a 
reaction cocktail containing 200 units of Superscriptll reverse transcriptase (RNase H-, Gibco BRL, 
USA), 10 units RNase inhibitor (Perkin Elmer), 200 \im dNTPs and 40 pmol of primer, in total 

10 volume of 20^1. For first strand cDNA synthesis, a single primer [5' - 

AG AAAAAC ATC ACCAACT-3 *] capable of hybridizing to both the ADC1 andADCl transcripts 
was used in the reaction. As a control, a primer ( 5 '-GC AACTGTC ATCTTATC ATCTTC-3 ') 
specific for the p-subunit of the tobacco mitochondrial ATP synthase . gene apt! (Boutry and Chua, 
1 985) was used in the reverse transcriptase reaction. 

15 " Following reverse transcription, the single stranded cDNA products were serially diluted over a 
concentration range between 1 to 50 ng RNA, and PCR amplification was carried out for 25 cycles 
of 45 s at 94°C, 1 min at 64°C and 1 min at 72°C in a Genemate thermocycler (ISC Bioexpress, 
UT). The reaction mixture contained cDNA template, 1 x PCR buffer (Boehringer Mannheim), 100 
yM dNTPs, 25 pmol of each forward and reverse primer and 1 unit Taq DNA polymerase. The PCR 

20 reactions specific for^4DC7 transcripts contained the following primers: ADCl-forward, 5'- 
CGTAG ACGCTACTGTTTC-3 ' and ADCl-reverse, 5'-TGGACAAC TGTGGAGGCG-3 ' . 
Reactions specific for ADC2 transcripts contained primers ADC2-forward, 5'- 
TGTAGATGCTGCTGTTGTTT-3 ' , and ADC2-reverse, S'-TGAACAAC TGCGGAGGCA-3 
Control reactions for normalization of amplification products contained 25 pmol of primers specific 

25 for the tobacco apt2 transcripts: atp2 forward, 5'-GTATATGGTCAAATGAATGAGCC-3' 3 and atp2 
reverse.int, 5 , -GCAGTATTGTAGTGATCCTCTCC-3'. For quantitation purposes, amplification 
reactions were supplemented with \\xCi 32 P-dCTP. PCR products were separated by electrophoresis 
through 1.2% agarose gels, the fractionated reaction products transferred onto a Hybond N+ 
membranes, dried and subject to autoradiography at -70° C. Quantitation was carried out by 

30 phosphorimaging using a Molecular Dynamics Phosphorlmager. Values were normalized relative to 
the intensities of the aip2 control band in each lane. The experiment was conducted twice with 
different total RNA samples, and representative results are presented from one of the two 
experiments. 
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Results 

These . studies show the structure and expression of individual members of the ADC gene 
family in tobacco. An a- 32 P-dCTP-labeled 2.7 kb EcoRI-XhoI fragment from PR24 encoding the 

5 ADC coding region was used to screen an X EMBL3 phage genomic library. From a screen of 

approximately 3 X10 5 phage, seventeen hybridizing phage were recovered, of which five were fully 
characterized by restriction mapping and DNA gel blot analysis. These phage fell into two groups 
based on their restriction profile. The relevant hybridizing fragments from the various phage were 
cloned into pBluescript and their nucleotide sequence determined. 

10 Presented in Figure 7 are the nucleotide and predicted amino acid sequences of NtADC-1 and 

NtADC-2 genes. Both genes contain a single open reading frame, uninterrupted by introns. The 
nucleotide and amino acid sequence encoded in NtADC-1 is identical to that of PR24, the full length 
cDNA isolated from N. tabacum cv Burley 21 . There are 84 nucleotide differences within the 
NtADC-1 and NtADC-2 coding regions, resulting in 23 amino acid differences between the ADC1 

15 and ADC2 proteins, respectively. The ADC1 protein is one amino acid shorter in length, missing 
Val-13. 

By comparison to the full-length cDNA, the 5 '-untranslated region (UTR) present in 
NtADC-1 and NtADC-2 are 43 1 bp and 432 bp long, respectively. The size of the 5'-UTR in the 
ADC transcripts is considerably larger than the average size of the plant leader sequence (Joshi, 
20 1987). In contrast, the 3'-UTRs present in NtADC-1 and NtADC-2 are relatively short, 

approximately 84 nucleotides in length. In both gene sequences, a conserved polyadenylation signal 
(AATAATA) can be recognized 23 nucleotides from the site of polyadenylation site found in the 
PR24 cDNA. 

Pairwise comparison of the N. tabacum ADC1 and ADC2 proteins with the ADCs of other 
25 plant species showed that the N. tabacum proteins are approximately 82% identical to the ADC of 
its evolutionary progenitor species N. sylvestris [Genbank Accession No. ABO 12873] and 86% 
identical to the ADC from tomato (Lycopersicon esculentum) [31], another member of the 
Solanaceae family (Figure 2), As might be expected, the N. tabacum ADC shares considerably less 
similarity to ADCs isolated from species more distantly related evolutionarily, such as Arabidopsis - 
30 67% identical [32, 33], soybean- 67% identical [34], and oat - 42% identical [35] and is only 29% 
identical to the enzyme from Escherichia coli - [36]. 

The predicted protein coding regions for the N. tabacum ADCs are substantially longer than 
those reported for the ADC proteins oiN. sylvestris and L. esculentum [31], but are similar in length 
to those reported in Arabidopsis, oat, soybean [32-35] and for the E. coli enzyme [36]. The 
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difference in overall length appears to arise from an apparent nucleotide deletion in the N. sylvestris 
and tomato cDNA sequences relative to the ADC1 and ADC2 predicted sequence and those in other 
plants. In the nucleotide sequences reported for both the N. sylvestris and tomato cDNAs, a guanine 
residue (position 2295 in the N. sylvestris sequence and 153 1 in the tomato sequence) is missing 

5 [Genbank Accession No. AB012873]. This deletion changes the reading frame and introduces a . 
premature termination to the predicted coding region. Using the sequence information available in 
the NCBI database, correcting for this error allowed us to extend the predicted C-terminus of the 
both ADC proteins, yielding the alignment to the N. tabacum ADCs and those of other plant ADCs 
as indicated in Figure 8. We have also included in the alignment shown in Figure 8, the correction at 

1 0 the N-terminus of the predicted tomato ADC protein sequence noted by Perez- Amado et al. [3 7], 
allowing better alignment of all of the higher plant sequences. 

Developmental regulation of arginine decarboxylase expression 

15 It has been shown that nicotine formation can be activated in the roots of maturing tobacco 

plants by topping, that is, removal of the flower head and several young leaves (Akehurst, 1981; 
Hibi, et al., 1994). Coincident with the activation of nicotine formation, there is an increase in the 
levels of transcripts encoding ODC, PMT and spermidine synthase (SPS) over the subsequent 24 hr 
period in wild-type plants (Hibi et al., 1994; Riechers and Timko, 1999). To determine the effects of 

20 topping on ADC expression in roots, Burley 21 plants were grown in the greenhouse to the bud stage 
at which point the upper 1/3 of the plant was removed and samples of roots tissues were collected 
before and at various times post-topping. As shown in Figure 9, ADC message abundance increased 
in the roots of topped Burley 21 plants during the 24 hr period after topping. Low alkaloid (LA) 
mutants of Burley 21 show a much lower level of ADC expression in their roots, and no induction of 

25 ADC transcript accumulation after topping. The lack of ADC induction in the low-alkaloid mutant is 
consistent with previous studies (Hibi et aL 1994; Riechers and Timko, 1999; Wang, 1999) showing 
a general inability to activate gene expression leading to increased poly amine formation and alkaloid 
biosynthesis as a result of the mutation of the Nicl and Nic2 regulatory genes. 

30 NtADC-2 is predominately expressed in roots of wild-type plants. 

Due to the high degree of identity between the NtADC-1 and NtADC-2 transcripts (e.g., 95.8% 
coding regions, 94.4% and 96.4% in 5'- and 3'-UTRs, respectively), it is impossible to distinguish 
between the two transcripts by RNA gel bot analysis. Therefore, we employed a RT-PCR based 
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strategy and gene specific oligonucleotide primers. Total RNA was extracted from tobacco roots, 
leaves and flowers, and single-stranded cDNA synthesized using an oligonucleotide primer capable 
of hybridizing to both ADC1 and ADC2 transcripts. As an internal control for amplification, a gene 
specific primer recognizing the atp2 transcript encoding the P-subunit of the tobacco mitochondrial 
ATPase was include in the reactions. Under experimental conditions providing amplification in the 
linear range of the PCR reaction, gene specific forward and reverse primers were used to specifically 
amplify either ADC1 or ADC2 cDNAs. Test reactions (Figure 10A) using plasmid DNA encoding 
NtADCl or NtADC2 as template demonstrated the specificity of the primers. As shown in Figure 
10B, the main transcripts detectable in all tissues tested are derived from NtADC-2. Flowers express 
the highest level of ADC, and leaves lowest. In the flowers, although ADC1 is detectable, far less 
than ADC2 Roots also express a significant level of ADC. 

ADC transcript levels are highest in the roots and floral organs, and low in other plant tissues. 
The two ADC genes investigated appear to have different modes of regulation, with ADC2 being 
predominately expressed in the roots and other organs. 

At the present time, only limited information is available on the nature of regulatory regions 
in the promoters of genes encoding enzymes of alkaloid biosynthesis. The availability of cloned 
genomic fragments encoding ADC allows one to begin mapping regulatory sequences within 
members of these genes responsible for tissue specific, developmental, and inducible expression. 



m. ODC 

The present invention features the genes of two members of the nuclear gene family encoding 
ODC in tobacco N. tabacum. As the following experimental discussion shows, the ODC-2 gene is 
preferentially expressed in roots and floral tissues. Furthermore, the abundance of ODC transcripts in 
root tissues is affected by topping. Furthermore, analysis of ODC transcript levels in roots of low 
and high nicotine producing lines shows that ODC expression is under the control of the Nicl Nic2 
regulatory loci. 
Materials and methods 

Plant growth and tissue preparation 

Seeds ofN. tabacum cv. Xanthi, wild-type and low alkaloid nicl nicl mutant N. tabacum cv. Burley 
21 were obtained from Dr. G. Collins (University of Kentucky, Lexington). Tobacco plants used for 
DNA isolation were grown in soil:vermiculite mixture in the greenhouse under natural lighting 
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conditions. Plants used for KNA extraction were grown either in Moltan Plus (Mohan Co., 
Middleton, TN) or hydroponically in a dilute (half-strength) Peters nutrient solution with continuous 
aeration of the roots under natural lighting conditions in the greenhouse. Topping experiments were 
conducted by removing the floral meristem, leaves and stem (approximately the upper 1/3 of the 
5 plant) from tobacco plants just prior to blooming. Floral parts and other plant tissues were collected 
from fully matured individuals, frozen in liquid nitrogen, and stored at -80 °C until used for RNA 
preparation (see below). 

Screening of genomic libraries and phage characterization 

A genomic library constructed in EMBL3 from N. tabacum cv. Xanthi leaf DNA (Clonetech, Inc., 
Palo Alto, CA) was screened by plaque hybridization (Sambrook et a/., 1989) using a 32 P- 
radiolabeled, 1.6 kb EcoRl-Xhol insert from plasmid PR46 as probe. PR46 encodes a full length 
ODC cDNA previously isolated by differential screening of plasmid libraries prepared from mRNA 
isolated from the roots of wild-type Burley 21 plants before and 3-days post-topping (Wang, J., 
Sheehan,-M., Bookman, H. and Timko, M.P., unpublished data). Hybridization was performed at 
65 °C for 16 h in a solution containing 0.25 M Na^O, (pH 7.2) and 7% (w/v) SDS. Following 
hybridization, the membranes were washed twice in 2 x SSC, 0. 1 % SDS for 1 5 min at room 
temperature, once in 0.2 x SSG, 0.1% SDS for 30 min at 65 °C. Hybridizing phage were picked and 
plaque purified through three subsequent rounds of hybridization screening. Phage DNA was isolated 
from plaque purified phage using a Qiagen Phage Midi Preparation Kit (Qiagen USA, Valencia, CA) 
and insert DNA characterized by restriction mapping and DNA gel blot analysis. The relevant 
hybridizing bands in each phage were cloned into pBluescript SK+ vectors for further analysis. 

Nucleic acid sequencing and analysis 
25 Nucleotide sequencing was carried out manually using the Sequenase Version 2.0 protocols 

according to the manufacturer's protocol (United States Biochemical, Cleveland, OH) or with an ABI 
310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA) using double-stranded plasmid 
DNA templates prepared utilizing the Qiaprep Spin Plasmid Kit (Qiagen USA, Valencia, CA). The 
nucleotide and predicted amino acid sequences of the various cDNAs were analyzed using BLAST 
30 sequence analysis programs (Altschul et a/., 1990; Gish and States, 1993) and protein sequence 
alignments were carried out using the PILEUP program (Genetics Computer Group Sequence 
Analysis package, Version 9.0 (GCG, University of Wisconsin, Madison, WI) and the various gene 
sequences available in the NCBI (National Center for Biotechnology Information, Bethesda, MD) 
nucleotide and protein sequence database. Manual adjustment of the sequence alignments were 
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carried out as necessary. 

RNA isolation and gel blot analysis 

Total RNA was extracted from tobacco roots, leaves, and floral parts using Tri-Reagent 

5 (Molecular Research Center, USA, Cincinnati, OH) according to the manufacturer's protocol. For 
RNA gel blot analysis, aliquots (10 ]ig) of total RNA extracted from the various tissues were 
fractionated by electrophoresis through a 1.2% agarose-formaldehyde gel and blotted onto Nytran 
nylon membranes (Schleicher & Schuell, Keene, NH) using 10 X SSC. The transferred RNA was 
UV cross-linked to the membrane using a UV Stratalinker (Stratagene, La Jolla, CA) and the 

10 membranes were prehybridized in 7% SDS, 0.25 M Na 2 HP0 4 , pH 7.2 for 2-4 hours at 65 °C. 

Hybridization was carried out in the same buffer in the presence of 32 P -labeled probes for 16 hr at 
65 °C. The membranes were washed under high stringency conditions and subject to 
autoradiography at - 80° C for approximately 48 h. 

Restriction fragments derived from cDNA clones of interest were separated by agarose gel 

15 electrophoresis, the DNA was purified, and quantified by spectrophotometry. [ 32 P]-dCTP -labeled 
probes were prepared from 25-50 ng of insert DNA by random primed labeling (Random Primed 
Labeling Kit, Boehringer Mannheim, Indianapolis, IN). As a control, the blots were also probed with 
radioactively labeled probes encoding the alkaloid biosynthesis enzyme putrescine N- 
methyltransferase (PMT) (Riechers and Timko, 1999), a root specific, topping inducible 

20 glucosidase encoding cDNA (TBG-1) (Riechers, D.E. and Timko, M.Pr, unpublished data), 26S 
rRNA(PR31) or 28SrRNA fragments. . 

Genomic DNA isolation and gel blot analysis 

Tobacco genomic DNA was prepared from tobacco leaf tissue by the method of Junghans and 

25 Metzlaff (1990). Total genomic DNA (15 ng) was digested to completion with EcoRl or Hindlll, the 
digestion products were fractionated by electrophoresis through a 0.8% (w/v) agarose gel, and 
transferred onto Nytran nylon membrane (Schleicher & Schuell, Keene, NH) in the presence of 0.4 N 
NaOH (Sambrook et al> 1989). Following transfer, the membrane was rinsed in 2 X SSC , the DNA 
was UV cross-linked to the membrane, and the membrane was prehybridized and hybridized as 

30 described above. Following hybridization and washing, the membranes were subjected to 
autoradiography at -80 °C. 

Results and discussion 
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Gel blot analysis of tobacco genomic DNA cut with various restriction enzymes and . 
hybridized with an [a- 32 P]-dCTP-labeled 1.6 kb EcdRl-Xhol cDNA fragment (PR46) encoding the 
full-length ODC protein from N. tabacum cv Burley 21 (Wang, J., Sheehan, M., Bookman, H. and 
Timko, MP., unpublished data) indicated ODC is encoded by small gene family in the N. tabacum 
5 genome (Fig. 1 1). Four to five major bands and several minor bands of sufficient size to encode full- 
length genes are detected in either EcoRl or HindlJl digested tobacco DNA. 

To further analyze the structure and regulation of members of the ODC gene family in 
tobacco, a X EMBL3 phage genomic library constructed with DNA from N. tabacum cv Xanthi was 
screened using a [a- 32 P]-labeled probes prepared from PR46 (as described above). From a screen of 
10 approximately 3 X10 5 phage, five hybridizing phage were recovered, of which three were fully 
characterized by restriction mapping and DNA gel blot analysis. Two phage proved to contain 
identical insert DNA and the third had a unique restriction digestion profile. Following DNA gel blot 
analysis, the hybridizing fragments were cloned into pBluescript and their nucleotide sequence 
determined. 

15 The complete NtODC-2 gene spans two Sail fragments of 2.7 kb and 6.5 kb. The coding 

region of the gene cbntains a singlel302 bp open reading frame uninterupted by introns (Fig. 12). 
The nucleotide sequences of NtoDC-2 is identical within the coding and 5' and 3'- untranslated 
regions to the PR46 encoded cDNA, with the exception of four nucleotide changes (residues +2, +4, 
+6 and +8) in the 5'-untranslated region. These nucleotide differences likely reflect changes 

20 introduced during the cDNA synthesis reaction. 

The predicted amino acid sequence for the NtODC-2 encoded protein (designated pODC2) 
(Fig. 13) is identical to the ODC characterized from Burley 21 tobacco encoded by PR46 (Wang, J., 
Sheehan, M., Bookman, H. and Timko, Mi*., unpublished data) and to the partial N. tabacum ODC 
cDNA sequence (PR17) reported by Malik et a/., (1996). Comparison of the predicted amino acid 

25 sequence for pODC2 with the ODC proteins characterized from two different tobacco cultivars 

showed that the pODC2 differs by 7 amino acid (98% identity) from the ODC protein characterized 
from the high alkaloid. cultivar, N. tabacum cv; SC58 [Genbank Accession No. Y10472.1] and by 7 
amino acid (98% identity) from ODC protein from BY-2 cells. The tobacco pODC2 is 89% and 
90% identical to the ODCs from tomato (Lycopersicon esculentum) and jimsonweed {Datura 

30 stramonium), respectively, but substantially less similar to ODCs from yeast (35% identity) and 
humans (32% identity). 

The NtODC-J gene, contained on an 4.0 kb Xbal fragment, encodes a single open reading 
frame of 141 amino acids encompassing the amino terminal one-half of ODC (Fig. 12). Six amino 
acid residue changes distinguish the NtODC-2 and NtODC-1 encoded proteins over the homologous 
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region of the proteins. Beginning at amino acid residue 130, the NtODC-i encoded protein (pODCl) 
diverges frompODC2, with a stop codon present after residue 141. Scanning. the available 
nucleotide sequence (> 1 kb) in the 3'-flanking region of the NtODC-1 gene failed to reveal any 
evidence for ODC homologous protein sequences in any of the three translational reading frames. 
Interestingly, a comparison of the 5 f -flanking sequence of the NtODC-1 and NtODC-2 genes . 
revealed that while the NtODC-2 gene has a clearly recognizable TATA-box properly located at 
approximately -35 bp from the transcriptional start site, no such regulatory motif is found in the 
NtODC-1 gene sequence. Consistent with this observation, RNA gel blot analysis performed using a 
hybridization probe prepared from NtOCD-1 immediately downstream of the frame shift, failed to 
detect any message in various tissues of mature tobacco plants (data not shown). Thus, it appears that 
NtODC-2 represents an unexpressed pseudogene in the N. tabacum genome. 

To determine the spatial pattern of expression of the NtODC-2 gene, gel blot analysis was 
carried out using total RNA prepared from roots, stems, young and mature leaves, and various floral 
parts of Burley 21 tobacco plants. As shown in Fig 14, transcripts encoding ODC were easily 
detected in the roots, with little or no expression in other tissues except sepals, carpels, and mature 
stamens. 

The formation of nicotine and total leaf alkaloids in tobacco is known to be under the control 
of at least two independent genetic loci (Legg et a/., 1969; Legg and Collins, 1971), designated Nicl 
and Nic2 (Hibi et a/., 1994). Nicl and Nic2 are semidominant and operate synergistically to control 
plant alkaloid content, with mutations within these genes resulting in plants with reduced levels of 
nicotine and total leaf alkaloids (wild-type > nicl > nic2 > nicl nic2) (Legg et a/., 1969; Legg and 
Collins, 1971). Although no information is available on the nature of their encoded products, it has 
been speculated that Nicl and Nic2 likely encode transcriptional regulators capable of globally 
interacting with a subset of genes encoding components of polyamine and alkaloid biosynthesis 
(Hibi et al, 1994). Removal of the flower head and several young leaves (i.e., topping) leads to 
activation of nicotine formation in the roots of decapitated plants (Akehurst, 1981; Hibi et a/., 1994). 
To determine the effects of topping on NtODC-1 expression in roots, Burley 21 plants were grown in 
the greenhouse to the bud stage at which point the upper 1/3 of the plant was removed and samples 
of roots tissues were collected before and at various times post-topping. As shown in Fig 14B, low 
levels of the ODC transcripts were found in roots prior to topping and message abundance increased 
approximately 2-fold in the roots of topped Burley 21 plants 4 hr after topping. By 24 hr after 
topping, ODC transcript levels return to their initial levels. Low alkaloid mutants of Burley 21 
subjected to the same treatment show a much lower level of stimulation of ODC transcript 
accumulation after topping, and the enhanced transcript abundance does not persist beyond 4 hr. By 
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comparison, transcripts encoding PMT and and a tobacco root-specific P-glucosidase (TBG-1) show 
patterns of accumulation similar to that observed for ODC transcripts in wild-type plants, but no 
induction in the low-alkaloid mutant, consistent with previous studies (Hibi et a/., 1994; Riechers 
and Timko, 1999; Wang, 1999). 

5 

IV. SAMS 

A single recombinant phage is identified as encoding for SAMS.. This X phage contains an 
approximately 15kB Sail insert. Restriction mapping and PCR analysis indicates that the insert 
DNA contains primarily the coding and 3 'non-coding portions of the SAMS gene. The nucleotide 
10 sequences for the gene encoding SAMS can be found at GenBank Accession Nos. AF27243 (full 
length SAMS cDNA). - . 

V. NADH dehydrogenase 

A fragment of the cDNA encoding for NADH dehydrogenase in N. tabacuum shows high 
1 5 expression in the roots of mature wild-type HP plants compared to low alkaloid mutant LP plants. 

VI. Phosphoribosylanthranilite isomerase (PAI) 

The gene encoding for a fragment of phosphoribosylanthranilite isomerase in K tabacuum is 
a homolog of the Arabidopsis thaliana gene encoding PAI, an enzyme involved in tryptophan 
20 biosynthesis. This enzyme is involved in the overall formation of aromatic compounds in plants. 
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What is claimed is: 

I . An isolated DNA molecule comprising the nucleotide sequence of (SEQ. ED. NO. 2), (SEQ. ID. 
NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 11), (SEQ. ID. NO. 13), (SEQ. ID. NO. 15), (SEQ. ID. 

5 NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26) or 

comprising a nucleotide sequence encoding the amino acid sequence encoded by (SEQ ID NO. 3), 
(SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) OR (SEQ. ID. NO. 24). 

10 2. A vector comprising the isolated DNA molecule of claim 1 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

3. An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of claim 1. 

15 

4. A vector comprising the isolated DNA molecule of claim 3 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

5. A cultured transgenic tobacco cell stably transformed with the vector of claim 2. 
20 ~ - 

6. A cultured transgenic tobacco cell stably transformed with the vector of claim 4. 

7. A transgenic tobacco plant stably transformed with the vector of claim 2. 

25 8. A transgenic tobacco plant stably transformed with the vector of claim 4. 

9. The isolated DNA molecule of claim 1, wherein the isolated DNA molecule comprises the 
nucleotide sequence of (SEQ ID NO:). 

30 10. A vector comprising the isolated DNA molecule of claim 9 operably linked to sequences capable 
of directing the transcription of a mRNA encoded by said isolated DNA molecule. 

I I. An isolated DNA molecule comprising a DNA sequence complementary to the nucleotide 
sequence of the isolated DNA molecule of claim 9. 
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12. An isolated DNA sequence comprising about a fifteen to about a twenty-five base pair 
oligonucleotide sequence identical to any consecutive about fifteen to about twenty-five base pair 
sequence found in (SEQ. ID. NO. 2), (SEQ. ID. NO. 5), (SEQ. ID. NO. 8), (SEQ. ID. NO. 1 1), 
(SEQ. ID. NO. 13), (SEQ. ID. NO. 15), (SEQ. ID. NO. 18), (SEQ. ID. NO. 21), (SEQ. ID. NO. 23), 

5 (SEQ. ID. NO. 25) or (SEQ. ID. NO. 26). 

13. A cultured transgenic tobacco cell stably transformed with the vector of claim 10. 

14. A transgenic tobacco plant stably transformed with the vector of claim 10. 

10 

15. A vector comprising a DNA sequence which encodes an antisense mRNA which is 
complementary to a fragment of a mRNA encoded by the isolated DNA molecule of claim 1 , . 
wherein said sequence is operably linked to sequences capable of directing the transcription of said 
antisense mRNA in tobacco cells and wherein the expression of said antisense mRNA in tobacco 

1 5 cells is sufficient to provide for reduced nicotine content in tobacco cells which are stably 
transformed with said vector as compared to untransformed tobacco cells. 

16. A cultured transgenic tobacco cell stably transformed with the vector of claim 15. 

20 17. An isolated and purified protein comprising the amino acid sequence identified in (SEQ ID NO. 
3), (SEQ. ID. NO. 6), (SEQ ID. NO. 9), (SEQ. ID. NO. 12), (SEQ. ID. NO. 14), (SEQ. ID. NO. 16), 
(SEQ. ID. NO. 19), (SEQ. ID. NO. 22) or (SEQ. ID. NO. 24). 

18. A method for regulating gene expression in a plant comprising functionally linking an alkaloid 
25 gene promoter to a nucleic acid encoding a protein, wherein the promoter comprises a nucleic acid 

sequence selected from the group consisting of the sequences identified in (SEQ ID NO. 1), (SEQ. 
ID. NO. 4), (SEQ ID. NO. 7), (SEQ. ID. NO. 10), (SEQ. ID. NO. 17),and (SEQ. ID. NO. 20). 

19. The method of claim 18, wherein the nucleic acid encoding a protein encodes a protein involved 
30 in the biosynthesis of alkaloids in plants. 

20. A plant transformed by the method of claim 18. 



WO 00/67558 PCT/US00/12450 

1/21 



Kpnl + 

Kpnf EcoH\ EcoRl 

9 A — SS5 — NtPMTIb 
6.8 — 



$.0 — J ^ *"' *mm NtPMT4 

N(PMT3 



♦,0— " «tPMT2 



3.0 — ' NtPMTIa 



2t5 _ — NlPMTIt 

2.0 — 



WO 00/67558 



PCT/US00/12450 



2/21 



NtPMT4 

NtPMT3 

NtPMTIa 

NtA411 

N1PMT2 



Exon 1 



ifflaaiiauimiMaaaiiaia^^^ 
!Ma™auiMtm@aniiaiaiziE^^ 



rciaMiiaiauiHifliaaiiMaia s?HHiMHB]giiaiH&ia@i< a i 



NtPMT4 

NtPMT3 

NtPMTIa 

NtA411 

NtPMT2 



" -™ taaaa 



OS 



saaa 



NtPMT4 

NIPMT3 

NtPMTIa 

NtA411 

NtPMT2 



NtPMT4 

NtPMT3 

NtPMTIa 

NtA411 

NtPMT2 



^Iw iiMM^Ei^iaMaiaiaBgiagia 



iiiaaa&amaHHKHaaMa: 



Exon 2 



liaasiiiasaiiiaiiiia^i^iEasiE^naia^aBQQBaBQsansiaiiiiiiiisisiiB 



NtPMT4 

NtPMT3 

NtPMTIa 

NtA411 

NtPMT2 



NtPMT4 

NIPMT3 

NtPMTIa 

NtA411 

NtPMT2 



NtPMT4 

NtPMT3 

NtPMTIa 

NtA411 

NtPMT2 



NtPMT4 

NtPMT3 

NtPMTIa 

NtA411 

NtPMT2 



NIPMT4 

NtPMT3 

NtPMTIa 

NtA411 

NtPMT2 



uuBUMiraiuiiaaigaiiHi^^ 



Exon 4 



Exon 5 



Maa^aaaaaaaiaiMiagiMaiMa^ 



Exon 6 



EMi»ir:iiiirei:i«i:^ 



Exon 7 



HiUHidHMaiaisaraiflaa^aayaauaENa^^ 



Exon 8 



UiaUU^UU12l5UliUL21illUJUi!il]lilL2UUli 

laauiM^uiaaQBiuuuiauuji^iiiiuJuuuLi 



r 


L 

r 


P 


s 




ft 




M 


1 


P 






R 


s 


M 


1 




L 


V 


S 




R 


s 


M 


1 




L 


P 






ft 


& 


Ml 


1 




L 


P 


s 




R 


5 


M 


m 



WO 00/67558 



3/21 



PCT/US00/12450 



//// 



420 bp 



^NtPWT-lb - 



2Z0 Of) 



STD 



NtPMTi 



NtPMt2 



WO 00/67558 



PCT/US00/12450 



NtPMT3 
NtPHT2 
NtPHT4 
NtPHTla 



4 / 21 



-gc tgtacaaaag gatgtctcaa atcatttgga atattaattc -1029 
— — — — ctgagttg -1039 



NtPHT3 
NtPKT2 
NtPKT4 
NtPHTla 



O-box 

tgcaatcaac aagaaatacc ccactattaa gacccattat cactggcaca aaaattatga gatca ttaaa catc ttaaac -949 
acaagaacaa ttcctggtga atcagatgga tgaagataat agaggtgggt ggaatctata accaaafjcag ctggptgagt -959 



NtPKTS 
NtPMT2 
NtPKT4 
NtPKTla 



ctgtcectat ttggaagagt gtggtatggg agatgcctcc caggagtacc taaagctgaa tatgatggaa gttttaacaa -669 
gactgtgcga gttgcagaaa caattgaagg gtcatttgtg gaatttgggg ccatttcaaa ggaaaaagaa aagatgactt -679 



NtPHT3 
NtPHT2 
NtPHT4 
NtPKTla 



acaaattggg aagcagggat tgagggattc tcagagatga agagggaggc tttgtcatgg ctttttcgat gcctataatc -7B9 
agcattaata aatcaaatta aaataaggct tagcgttaaa atcaaaggaa atggcaagcc tggctcctgg agcaatgctt -799 



NtPMT3 
NtPMT2 
NtPMT4 
NtPHTla 



tataataaca tcagtgaagc agaattgaaa gccatcaagt atgggtgtga atggtgcaaa tacaaaggaa tatcaaactt -709 
ctgaggacag tagtaaaaac aatatcagac aaaaagtaaa gttgtattat ttagcttgag gataaagtat gtcattagtt -719 



NtPKT3 
NtPKT2 
NtPMT4 
NtPKTla 



NtPHT3 
NtPKT2 
NtPKT4 
NtPKTla 



NtPKT3 
NtPHT2 
NtPKT4 
NtPHTla 



cattgtggaa 
ttgtgagaga 



ccgagaaatt 
caagcccctc 



actgactcga ggatgatcta tgacatacta cagaccaaaa 
tttggtgtcc tctacaatga ttgttgaagt ccctatttat 



aagcaa 
atacac 



caacaagttg aaacaagaga 
aggaaacaaa atcctaggat 



-629 
-639 



-g Jgat||caatgg agaaggaaaa tatttccagt -625 



aatggagatt 
ttaaatgaca 



O-Box 

ctggac a*cct g^agga cacc tgttacccat tgccttcgcg 
ataatggggt taatgatgaa tatgtagcgg catgacatga 



aagcaf^ftca ag^ggcagac 
atgcc&gaat tc§ecgcaac 



-549 
-559 



gtaaacacaa gtgaatgaag agaagccaaa ataatctcta tcattcaagc cttaggtgga gatta^gaaa at§atttact -545 

PAL 



tggtttgcta 
gactatttat 



aag^ggccac 
ttaa}tattga 



cagagctaac gaaggt 
ggaatatttt ttatta 



tea gtcae aea^a ttrtagaea fe 

atj| gttatctggtg acaagcattc 



gtatcaaaag 
gtttgettec 



eggecaaggg 
gttgattacg 



-469 
-479 



ttcttatcaa agt^ataggt g a tea a cage tttcgt§iaa£ igtcattagg agaatattat aatctctttt atgetgaaga -465 



atggatatgf 
gatctactcf 



ggcaggtccc ttattttaga attaga^atg aaaaapctaa 
ataccaaccg aagccgttgt ccttgajfectt egcttgeatt 



tttttttttg 
taattcatct 



taagttaatt 
tccgtgtgcc 

Gtggacc 

gctagfctggt 



-3B9 
-399 
-3B9 
-3B5 
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B 



X 12 J* 



i*rti 
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NtADCl 

ttcacgttctcttctcaattcccataaaagaaacccttcccftta'cf 
gtttccgtcctatttt - -ctcttcttctacgctt c 7 8 
NtADC2 

. . . . c 

tc. . .a c 

. ,t. . . 80 



319 



NtADCl 

ctcttctgatatcaatatctgtatggtgtttttcttg 
ttcgaattttagatttgttttgcctttaatacctgta 
acctta 158 
NtADC2 

a 

t. . a. . 

160 

NtADCl 

taattctctgtttaaaccaaaaacttagcttcttctg 
aagtcagggtggggatatttggatcgtgtaagagtgt 
gttaga 238 
NtADC2 - 



239 

NtADCl . 

aggtgattatcttttgattcagttccttttttgcttc 
ttttgagggggtagccggggcctcggcctcggcgggt 
tttaat 318 
NtADC2 

g • • 



NtADCl 

agcccccatctattacaaccattgggcaaaaacatca 
ttaaatctgtacaaagcaaacccttaatttagtttaa 
ttttct 398 
NtADC2 

r t 

a 

399 



MPALGCCVDA T- 
V S P P 
NtADCl 

gtattctttgattctttaacagaagaagaagagATGC 

CGGCCCTAGGTTGTTGCGTAGACGCTACT 

CTTTCCCCTCC 475 
NtADC2 

a t ATG. 

T T. . .G. .GTT 

479 

. 1 

* * * * 

16LGYAFSRDSS 
LPAPEFFTSGVPP 
T N S A 
NtADCl 

CTCGGCTATGCCTTCTCTCGGGATAGCTCTCTTCCCG 
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CGCCGGAGTTCTTTACCTCCGGCGTACCTCCTACAAA 
CTCCG 555 
NtADC2 , 

. . .A 

G G 

. . .T. 559 

^7 *s******** 
*******£***** 

* * * * 

43 AGSIGSPDLS 
SALYGVDG WGAPY 
F S V 

NtADCl 

CCGCCGGTTCCATTGGGTCTCCGGATCTGTCCTCTGC 
TTTGTACGGGGTCGATGGGTGGGGAGCTCCTTATTTC 
TCCGTT 635 . 
NtADC2. 

T.C T. . . .G 

. . .A 

. .T. . . 639 

44 * A * * * * * " * * * 
************* 

* * * 

69NSNGD I S V. R P 
HGTD TLPHQEIDL 
L K V V 
NtADCl ^ 

AACTCTAACGGAGATATCTCCGTCCGACCACATGGTA 
CGGACACACTCCCCCACCAGGAAATTGACCTTCTCAA 
GGTCGT 715 



NtADC2 

T C 

T T 

719 

70* **** * * * * * 
***** ******** 

* * * * 

96 KKASD P KNS G" 
GLGLQLPLVVRFP 
D V L K 
NtADCl 

GAAAAAGGCCTCCGACCCGAAAAATTCAGGGGGGCTC 

TGCTAA 795 
NtADC2 

T T 

..T.G. 799 

97********** 

************* 

* * * * 

• 123 NRLESLQ SAF 
D LAVH SQGYGA HY 
Q G V 
NtADCl , 

AAAACCGGTTGGAATCTCTGCAATCGGCTTTTGATCT 
CGCTGTTCATTCCCAGGGCTATGGGGCCCACTACCAA 
GGTGTT 875 
NtADC2 



TK^wrC-n (b) 
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. . .G 

879 

124 ********** 
************* 

* * * 

149 YPVKCNQDRF 
VVEDIVKFGSSF R 
F G L E 

NtADCl 

TATCCCGTGAAATGCAATCAAGACAGGTTCGTGGTGG 
AAGATATTGTCAAATTCGGGTCGTCATTCCGGTTCGG 
GTTGGA 955 
NtADC2 

C C 

959 

150 ********** 
******* * * * p * * 

* * * * 

176 AGSKPEL.LLA 
MSCLCRGSAEGLL 
V C N G 
NtADCl 

AGCTGGGTCTAAACCCGAGCTCCTGTTAGCCATGAGC 
TGTCTCTGCAGGGGCAGTGCTGAGGGCCTTCTCGTTT 
GCAATG 1035 
NtADC2 

. . .C 

A 

1039 



^77 **** ****** 

*****K** ***** 

* * * * 

203 FKDAEYI SLA 
LVARKLMLNTVIV 
L E Q 

NtADCl 

GTTTCAAGGACGCTGAGTACATTTCGCTTGCTTTGGT 

GAACAA 1115 
NtADC2 

G. . . 

1119 

204 ********** 

* * * * * * , * * * * * * * 

* * * 

229 EEE LDLVIDI 
SRKMAV R PVI GLR 
A K L . R 
NtADCl 

GAGGAGGAGCTTGACCTTGTGATTGATATAAGCCGTA 
AGATGGCTGTTCGGCCCGTAATTGGACTTCGGGCTAA 
GCTCAG 1195 
NtADC2 

A. . 

T 

1199 

230* * * * * . * * * * * 

* jj * * + ****** ** 

* * * * 



(c) 
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256 TK HSGHFGST 
SGEKGKFGLTTT Q 
I . V R V 
NtADCl 

GACCAAGCATTCAGGCCATTTTGGATCCACTTCTGGA 
GAAAAAGGTAAGTTTGGGCTTACAACGACCCAAATTG 
TTCGTG 1275 
NtADC2 



1279 

257 ********* * 
*****.****** ** 
* * * * 

283 VKKLEESGM L 
DC LQL LHFHIGSQ 
IPS. 

NtADCl 

TAGTGAAGAAGCTGGAAGAATCCGGAATGCTGGATTG 
CCTTCAGTTGCTGCATTTTCACATTGGATCTCAGATC 
CCTTCA 1355 
NtADC2 

.G. . A 

T 

T 1359 

284 ********** 



* * * 



309 TALLADGVGE 
AAQ.IYC EL IRL GA 
G M K F 

NtADCl 

ACGGCGTTGCTTGCTGATGGTGTTGGTGAGGCTGCTC 
AGATTTATTGTGAATTAATCCGTCTTGGTGCGGGTAT 
GAAGTT 1435 
NtADC2 

G A A C 

G. A 

1439 

310 *G * * ****** 
**** ****V**** 
* * * * 



336 IDTGGGLGID 
YDGTKSCDSDVSV 
G Y G I 

NtADCl 

CATTGATACTGGAGGTGGGCTCGGAATTGATTATGAT 
GGTACTAAATCATGTGATTCAGATGTCTCTGTTGGCT 
ATGGCA 1515 
NtADC2 

T T 

C T. 

1519 

337 * * i ****** * 
************* 
* * * * 



(<») 
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363 QEYASTVVQA 
VQYVCDRKGVKHP 
VIC 

NtADCl 

TTCAAGAATACGCCTCCACAGTTGTCCAGGCGGTTCA 
ATATGTTTGCGAC CGTAAGGGCGTGAAGCACC CAGTG 
ATTTGC 1595 
NtADC2 

. . T G .T 

A T A 

..C... 1599 

364 *** **£**** 
************* 
* * * 

389 SESGRAIVSH 
HSILI FE AVSASS 
H S C S 
NtADCl 

AGCGAAAGTGGCAGGGCAATTGTTTCTCATCACTCAA 
TTCTGA.TTTTCGAAGCCGTGTCTGCTTCTAGTCACTC 
ATGTTC 1675 
NtADC2 



... 1679 

390 ********** 



* * * * 

416 S S H L S S G G L Q 
SMAETLNEDA LAD 
Y R N L 



NtADCl 

TTCTTCACATCTGTCTTCTGGTGGCCTCCAATCCATG 
GCGGAGACGCTCAATGAAGATGCCCTTGCTGATTACC 
GCAATT 1755 
NtADC2 



C 

1759 

417 * * * * * * * ' * * * 
************* 
* * * * 

443 SAAAVRGEYE 
TCVLYS. DQLKQRC 
V D Q 
NtADCl 

TATCTGCTGCTGCAGTTCGTGGAGAGTACGAGACGTG 
TGTACTTTACTCTGATCAGTTGAAACAGAGATGTGTG 
GATCAG 1835 
NtADC2 

. T A. . 



1839 

444 * * * * * * * * * * 
**** **** ***** 
* * * 

469 FKEGS LGIEH 
LAA VDS I CDFVS K 
A M G A 
NtADCl 

TTTAAAGAAGGGTCCTTGGGTATTGAACATCTTGCTG 
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CTGTTGATAGCATCTGTGATTTTGTATCAAAGGCTAT 

GGGGGC 19X5 

NtADC2 



1919 

470 * * * * * . * * * * * 
************* 

* * * * 

496 ADPIRTYHVN 
L SIFTSIPDFWAF 
G Q L F 
NtADCl 

TGCTGATCCTATCCGCACTTACCATGTGAATCTGTCA 
ATTTTCACTTCAATTCCTGATTTTTGGGCCTTTGGTC 
AATTGT 1995 
NtADC2 

G 



1999 

497 ***v****** 
************* 
* * * * 

523 PI VPIHRLDE 
KPAVRGILSDLTC 
DSD 
NtADCl 

TTCCGATTGTTCCAATACACCGTTTAGATGAAAAGCC 
TGCAGTAAGGGGAATATTATCGGACTTGACTTGTGAC 
AGTGAT 2075 



NtADC2 

T C 

G ...:A 

2079 

524 ********** 
* * *********** 



549 GKVDKFIGGE 
SSL QLHELGSNGD 
G G G Y 
NtADCl 

GGGAAGGTTGATAAGTTCATTGGTGGCGAATCAAGCT 
TGCAGCTGCATGAATTGGGAAGTAATGGCGATGGTGG 
TGGGTA 2155 
NtADC2 



. . .C. . -A 

...T.. 2159 

55Q ********** 

***p********* 
* * * * 

576 YLGMFLGGAY 
EEALGGLHNLF GG 
p S V V 
NtADCl 

TTATCTGGGGATGTTTTTGGGTGGGGCTTATGAGGAG 
GCGCTCGGAGGACTCCACAACCTGTTTGGTGGACCAA 
GCGTGG 2235 
NtADC2 
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.T. .C. 2239 

577 ********** 
********** *** 

* * * * 

603 RVVQSDSAHS 
F AMSRSVPGP SCA- 
D V L 

NtADCl 

TGCGCGTGGTGCAGAGCGATAGCGCTCACAGCTTCGC 
CATGTCTCGCTCCGTCCCTGGCCCGTCCTGCGCGGAC 
GTGCTC 2315 
NtADC2 

T. . 

... .A T T.T. 

2319 

604 * * * * * * * * * * 
***X********* 

* * * 

629 RAMQHE PEL M 
FETLKHR AEEFLE 
Q E E D 
NtADCl 

CGAGCGATGCAGCACGAGCCCGAGCTCATGTTCGAGA 
CTCTCAAGCACCGTGCGGAGGAATTCTTGGAACAAGA 
AGAAGA 2395 
KtADC2 



. . .T . . 2399 



630 ********** 
************ * 
* * p * 

656 KG LAIASLAS 
SL AQSFHNM P YL V 
A P A S 
NtADCl 

CAAAGGGCTGGCCATTGCATCTTTGGC CAGCAGCTTA 
GCTCAGTCCTTCCATAACATGCCTTACCTTGTGGCGC 
CTGCAT 2475 
NtADC2 

TG. . .A G. . 



. .T. . . 2479 

657 ****VE**** 
*y*********** 

* * S * 

683 CCFTAVTANN 
GG YNYY YSD ENAA 
D S A 

NtADCl 

CTTGCTGCTTCACTGCAGTTACTGCTAACAACGGTGG 
CTATAACTACTATTACAGTGATGAGAATGCAGCAGAT 
TCTGCT 2555 
NtADC2 

C T.C A T 

T 

2559 

684 *. R * * * A * D * * 
************* 

* * * 
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^ 



709 TGEDEIWSYC 
T A *** 

NtADCl 

ACAGGGGAGGATGAGATTTGGTCCTATTGCACTGCTT 
GAagtgttgtcgtagcatctccagttttagtttgtcg' 
tcgaag 2635 
NtADC2 



g. 2639 

710 *★★****'* 
* * ★ * *★* 

720 
NtADCl 

ttgtctgtttttgaataatacccttagttggtgatgt 
ttttct 



T 



c 



2678 



NtADC2 



aataata 



2682 



Til 
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K. mum 

K. Mil 




H. aylvwtrl* 
U. wntlwtw 




M. ■yivMtri* 



ITTlQtliaiMtBmottqLW 391 
4« 



ft. MX 

A. MUM 
1~ Mil 



B. •ylvwtrl* 



ft. M« 
&. rail 



M. «yl«Mtcl« 





733 



PCT/USOO/12450 



16 / 21 





WO 00/67558 



17 / 21 



PCTAJSOO/12450 



Hindm EcoM kb 




WO 00/67558 PCT/US00/12450 

18/21 

-135 . 

ODC2 CCTACCCCTT CAACAGCTAT TTCTCTAAAA AAAAAAAARA AAOAAOAAA A TACTACGTAG ATTACACAAT ATTATCA CTA OTAOTATCAC HIILU ltlL 
OOC1 AAAGTGACTT TCACCCAATA TGACCCTCTC AAAHCTTTRA AAAAACAGTT TTTTTTTATT 1 M III I ATT TCCTCCAAAA AACACATTTT AAGGTATTTT 

-3 5 TATA box 4-1* * ♦ * « 

ODCJ TC tttttt»» AT GATAAACATT TTTAGACCTT TCCCCGTCTC A»rTO M r i AOA M 1A TAT TCATATTATT OARICCCTAG H1UUJL1 UtltlllU A 
OOCJ TAACCACATT I1U.1CCU1 U.1 T TULW. CTCCTATTCO ATTCCTTTAA CAATOOCTTC AACCATGGTO GCAACCTAXT CTCBTTTTCT TTCCCTTTGA 

66 "5 

ODC2 ' l l LCi r CC T i: TCATTTACCT L1C1H lilt ULLJ 1 mi I TGGATGGCCG GCCAAACAAT CATCCTTTCC C UUMUA ACC CCGCCCCCAT It J 1LACTOC 

OOC1 T IUCTTUnL TCATTTACCT CTATCTTTTC TTALJ I IU1 I TGGATGGTCG GCCAAACAAT CATCCTTTCC GOOTTOAACC CACCGCCCAT TCTTCAOTOC 

pODC2 KA CQTX ZV6 CLH P A A I L QS 

pOOCl V c 

166 2« 

ODC2 ACAATTCGCC GCCGA CC T TC TCCTACACCC GCGGCGGCGG CGGAAAA CG fl CACCAGAAAA GTCA TCCCTC TCTC AACAOA TGCCTTA CAA C ATTTCA TCT 
OOC1 ACAATTGCCC GCGGAGCTTC TCCTACACCC OCGGCCGCOG CGGAAAACGA C» CC *flA>M GTCATCCCTC TTTCAAGAGA TGCCCTACAA GATTTCATGT 
pODC2 TXG GGAS PTA AAA A E K G TfcK VXP LSAD ALQ DFM 
pODCl D 



266 



365 



ODC2 TATCAATCAT AACCCAAAAA TTACAAGATG AOAAACAACC TTTTTACGTG CTAGACTTGG GTGA GOTTCT TIL It 1 1 ATC GACCAATGGA AATCTGCTCT 

OOC1 TATCAATCAT AACTCAAAAA TTACAAGATG AGAAACAACC TTTTTACGTG CTAGATTTGG GTGAGGTTGT TTCTCTTATG GACCAATGGA AATATGCTCT 

POOC2 L S I I TQK LOO IKQP F T V L D L 6IVV SLH OQW KSAL 
pODCl - * 



366 



465 



ODC2 CCCAAATATC CGTCCATTTT ACGCTGTTAA ATGTAACCCT GAACCGTCGT TCCTTTCAAT TTTATCTGCT ATCG GCTC AA Al 1 1 IU ATTG TGCTAGCCGA 

OEC1 CCCAAATATC CGTCCATTTT ACGCTGTTAA ATGTAACCCT GAATCGTTGT TCCTTTCAAT TTTATCTGCT ATCGGCTCAA ATTTTGATTG TGCTAOCCGA 

PODC2 PHX RPF YAVK CMP EPS F L S I L6A MCS BIDC ASK 

pODCl S L 



466 



565 



ODC2 GCTGAAATTG AGTAT UI 1 1 1 ATCTCTTCGC ATTTCACCTO ACCOTATTOT TTTCCCAAAT CC A T OCRM C CGGAATCCOA TATTATTTTT CCAGCAAAAG 

OOC1 GCTGAAATTG AGTAT UI 1 1 1 ATCTCTTGGC AATAAOAAAA GAOAOAGOZC AATOGORAC TTUATTTOAT GAAAGTTTGG GAAATTAATA TTCGGGTTGT 

pOOCZ All STVL S I 0 ISP DRIV FAR F C H ' F I « D XIF A A K 
pOOCl MKKREASMGYLX. . 

566 «* 

ODC2 TTGGGGTGAA TCTTACAACC TATGATTCTG AAGA CC AGGT TTACAAGATC CGAAAGCATC ACCCGAAATC CCAACTCTTC CTCCGCATCA AGCCCATGCT 

ODC1 CTTCGTATCG TCATGGGAAT CTTTAOCTOA AGTTATAACA AATTTGGAGG AGTTTCTCTT AAAAATTTGG ATTAAAATCG TGCTTGGAAC A AG A* C A C A C 

PODC2 VGVII ITT TDS E D E V TKI A K H BPKS ELL LAX KPKL 



666 



765 



COC2 CGACGGCAAC GCGAGATGCC CAATGGGCCC GAAATACGGC GCGCTTCCAG AAGAA CTCGA CCCGC TGCTC CGGGr ACCTC AAGCCU.C CG TCTC ACCCTA 

ODC1 ATGAATAAAG CGAAGAACAC CAAGACCACT OATTTCCAAA ACACCAAATT TCAATTTTTT TAAACGTTTT CTTTCTTGGT TGGGTGTAAA TTAAGCTTTT 

pODC2 DOB ARC PHGP KTG ALP E E V D PLL AAA Q A A R LTV 

766 ~ «« 

OOC2 TCC GGC CTCT CATTCCACAT CGGTAGCGGA GATCCCCATT CAAACGCTTA Ttr UttJCCC ATAGCCGCCG CTAAGGAACT CTT TGAAACA GCTGCTAA AC 
ODC1 Lll 1 IH Hi TTAGAATGTT AT TTTT ATTT TATTTATTAA ATAGATTTAA CATAGTTTTT TTTACTCAAA ATAATATATG TCATTTTTTT ATTCQTCACT 
pGDC2 SCV SFHX G SG DAD SBAT LGA ZAA A KEV FET AAK 

"« - W 

QDC2 TCGGGATCTC GAAAATGACT GTTCTASACG TCGCCGGCCG GTTTACATCC GGCCACCAGT TCACAACCGC CGCCGTCGCC CTTAAATCAG C TTTAAAAC A 
ODC1 CCCCACGTCA GCASCGACTG CATTGCACAA ACTTTGTAAG TTTGOCTGAT TGTTAAATAA GTGCTCAATT GGAACAAAGT TCATBTAACC TATGTGCTCA 
pODC2 L G M S . KMT VLD VCCG FTS GHQ FTTA AVA VKS ALKQ 

966 "€S 

ODC2 ACACTTCGAT GACGAACCGG AGTTGACAAT CATAGCTGAA CCGGGTCGGT TTTTTGCAGA GACGGCGTTT ACTTTGGCAA CGACGATTAT AGGGAAAAGA 
ODC1 ATAGGAACTC TCTTAACTTT AGGTGTCTAA ATGAAAGATC GTGCCAACTT TAAGTCTCTC CGTATGTATT CAGCCAAAA T AATGTAAGCC AAATGTAGTC 

pODC2 bfo oip elti iae pgr r r a e TAF TLA TTXI GKR 

1066 "« 
OOC2 GTGAGGGGTG AATTGAGGGA GTATTGGATT AACGACGGGC TGTACGGTTC GATGAACTGT GTACTTTACG ACCA7GCGAC G GTGAATGCA ACGCCGTTAG 

ODC1 AATAAAGCGA TCGTGCTABA ACCACGGGAC TCAGGOAATG CCTTACACCT TCTCCCCGGT C AACAGAA T T CCTTACTCGG AGTTTGTTTT CGAAGACCAA 
pGDC2 VUG E L R S Y W X WDG L Y G S MUC VLT DHAT V K A TPL 

1166 ' »« 

COC2 CT G TTCTGTC GAATCGTAGT AACGTTACCT GCG GCGGG TC GAAAACGTTT C CG ACGA C TG TGTTTOOGCC CACTTCTCAT GCTCTTGATA CTGTTTTAAG 

ODC1 TAATAATAGA GTGAAACCTT CCTTTGAATA GGGATTCAAA AAAAAGGTGA CTTGGAACAC C AGCAAAAA T TAATTCCTAG TGGCGACACT GTAAATAAAA 

DODC2 AVLfi H R S WVT CGGfi KTF PTT VFGP TC D ALD T V L R 



1366 



13 €S 



ODC2 GGATTACCAG TTACCGGAGC TGCAGGTTAA TGATTGGCTG GTTTTTCCTA ATATGGGTGC TTATACTAAA CCTGCTGGGT CCAATTTTAA TGG ATTTAA T 
ODC1 TAATCCCTAT TTCAAATTTG TCACTTTAAT TGGAAAAACT CTTTCACCCA CAATCCATAA CAACACATTA TCTTTTGGAG GTOTAAAAAG GTGATCTGAC 
POOC2 D Y Q LPS LOVM D V L VFP RKGAYTX AAG SH FH GFW 

1366 ™* 

ODC2 ACTTCCGCCA TTGTTACTCA CCTCGCTTAT TCTTATCCAA GCTGATGAAC CACCTGTATT AGGAATTACT ACCGTGGTTT TGATGGTTTT Tlttll 11 1 T 
ODC1 AGCTCTAGCA ACTCTGCTGG GGGCTATTAA TAAGAATTCG AGCTTTGTAT ATTGATTTTT ATTTCGCTTT TATCATGTCT TCGATATTAT TGTGTTTGGG 
pODC2 TSA I V T B LAT SYP S. 

Poly A Blgrpal 1565 

, ODC? GGGTATCTTT TTTTTAATTT lUlJUUlll GGTAGTAATT TATATTCCAA ATCAGCTTGT AATTCTCTTC TATGCC K^^^g TGCAAGG Al 1IUL 1AAT 
ODC1 AGCATAATGT TCTATTTGTC TCTTATTTAT CGCTTTAATA GTTATTTAAA CTCTGATATA AATTGTATCC TATCTCCCAC CCCTCTGAGT CTTCTGATAG 

1S66 iPoly A site *« 5 

COC2 TGTGATTTTC TCTAATATGG AACTTTTTAA AATTAGTTTA AGAAACATAA TGGGTAAAAG GTTTGTGGGG TCATOATATT TGTCTGACTA TAAAAGCATC 
ODC1 GTAGTTATCT TGTCTTTGCC TACCAGCATC ATAATATTTC TGTCTTGAGA TAAAGCCAGT TAGCCTACCA GCTTTTGCTG AAGGATTTAA TCACATATGT 
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1 

N. tabacum cv 

N. tabacum cv 

N. tabacum cv 

D. stramonium 

L. esculentum 

S. cerevisiae 

H. sapiens 



N. tabacum cv Xanthi 
N. tabacum cv BY2 
N. tabacum cv SC58 
D. stramonium 
L. esculentum 
S. cerevisiae 
H. sapiens 

101 

N . tabacum cv Xanthi 
N. tabacum cv BY2 
N. tabacum cv SC58 
D. stramonium 
L. esculentum |pNS| 
S. cerevisiae. . N| 
H. sapiens 

151 

N. tabacum cv Xanthi 
N. tabacum cv BY2 
N. tabacum cv SC58 
D. .stramonium 
L. esculentum 
S . cerevisiae 
H. sapiens 

201 

N. tabacum cv Xanthi 
N. tabacum cv BY2 
N. tabacum cv SC58 
D . stramonium 
L. esculentum 
S.. cerevisiae - H~ AT 
H. sapiens 3 H-AT 

251 

N. tabacum cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae 

H. sapiens Tj 

301 

N. tabacum cv Xanthi 
N. tabacum cv BY2 




msstqvgnalssstttlvd: 
If— gn 



ETLHNLLLELK@NQD 



100 
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N. tabacum cv SC58 

D. stramonium Hs 

L . esculentum gAPj 

S . cerevisiae STAVL: 

H. sapiens / ITGVINP] 

351 

N. tabacum cv Xanthi 

N. tabacum' cv BY2 

N. tabacum cv SC58 

D. stramonium 

L. esculentum 

S. cerevisiae 

H. sapiens 

401 

N. tabacum cv Xanthi 

N. tabacum cv BY2 

N. tabacum cv SC58 

D. stramonium 

Li esculentum 

S. cerevisiae 

H. sapiens 




DFESTTAVLDSINpRSEYPYKVSI 
LLQKRPEPDEKY-YSSSI 



451 

N. tabacum cv Xanthi 

N. tabacum cv BY2 
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<120> Regulation of Gene Expression in Tabacco for 

Manipulation of Plant Growth and Secondary Metabolism 

<130> 4981*239 

<140> 
<141> 

<160> 26 

<170> Patentln Ver. 2.0 

<210> 1 ■ " ' . 

<211> 1120 
<212> DNA 
<213> Plant 

<400> 1 



atactttctc ttctccaatt tgtttagttt aattttgaaa 



aagataatag 


aggtgggtgg 


60 


tgcagaaaca 


attgaagggt 


"120 


gatgacttag 


cattaataaa 


180 


ggcaagcctg 


gctcctggag 


240 


aaagtaaagt 


tgtattattt 


300 


tggtgtcctc 


tacaatgatt 


360 


cctaggatca 


agcccctctt 


420 


tgacatgaat 


gccaaaattc 


480 


attaaatact 


atctggtgac 


540 


tctactctat 


accaaccgaa 


600 


cgtctgcctc 


cgatttcaca 


660 


accctataca 


aatggtacat 


720 


cgaggagtaa 


acagatgcga 


780 


gagagaaatg 


aacacacaca 


840 


acagagacta 


aattcatatt 


900 


gtaatgattt 


tttaactcta 


960 


gtatataaat 


gcatatgtgt 


1020 


gtaatacttg 


ttgaaacata 


1080 






1120 



<210> 2 
<211> 3091 
<212> DNA 
<213> Plant 

<400> 2 
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ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 
aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 120 
catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 180 
tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 24 0 
caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 
agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 
gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 
aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 
tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 
aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactctat accaaccgaa 600 
gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 
agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 72 0 
cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 
agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 
tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 
ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 
ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 
ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 
atactttctc ttctccaatt tgtttagttt aattttgaaa atggaagtca tatctaccaa 1140 
cacaaatggc tctaccatct tcaagagtgg tgccattccc atgaatggcc accataatgg 1200 
cacttccaaa caccaaaacg gccacaagaa tgggacttcc gaacaacaga acgggacaat 1260 
cagccttgat aatggcaacg agctactggg aaactccaat tgtattaagc ctggttggtt 1320 
ttcagagttt agcgcattat ggccaggtta gtactgagaa agaaactcaa atgcatattt 1380 
aaagttaaaa ttgttaggct aatataagga gttgatattc ttttagtgat taattaaaaa 1440 
ggaaaaagta tcaaataaat tcaaaaaatg. gatagtaact tcgcatatta ctctacacat 1500 
taatttgaaa taaatcgaat tttgcaggtg aagcattctc acttaaggtt gagaagttac 1560 
tgttccaggg gaagtctgac taccaagatg tcatgctctt tgaggtaaat aatattttaa 1620 
tacacatgct tccatttaaa ttgatacttt taatttactt ttactttatt gcatgtgtac 1680 
gtacagtcag caacttatgg gaaggttctg actttggatg gagcaattca acacacagag 1740 
aatggtggat ttccatacac tgaaatgatt gttcatcttc cacttggttc catcccaaac 1800 
ccaaaaaagg ttttgatcat cggcggagga„ attggtttta cattattcga aatgcttcgt I860, 
tatcctacaa. tcgaaaaaat tgacattgtt gagatcgatg acgtggtagt tgatgtaagt 1920 
caaacttctt ttactcacat aaaaaaatgg tttagattgc ttcttgttat ttttctaaaa 1980 
gaatactatt tttttaaaac aaaattttct tttttacagg tatctagaaa atttttccct 2040 
tatctcgctg ctaattttaa cgatcctcgt gtaaccctag tccttggaga tggtgcgtat 2100 
ttgataatct cgcttttgtt ttatctttta tttttattgc atttaatttt taccttttgg 2160 
tgtgtggtta attcacctgc cattggttct ctttcatttc aggggctgca tttgtaaagg 2220 
ctgcacaagc agaatattat gatgctatta tagtggactc ttctgatccc attggtactc 2280 
tattacttct taataccaag actaatctta ttgaataagc tactaataaa cggtaattga 2340 
tttctaaaac aatataattt caggtccagc aaaagatttg tttgagaggc cattctttga 2400 
ggcagtagct aaagccctaa ggccaggagg agttgtatgc acacaggctg aaagcatttg 2460 
gcttcatatg catattatta agcaaatcat tgctaactgt cgtcaagtct ttaagggctc 2520 
tgtcaactat gcttggacta ctgttccaac atatccaacg tatttttctc tctctctctc 2580 
ttcctataaa attggaagtt ttgattctat aattgtcaag aaatggagaa tcagttccaa 2640 
gaaaaaccaa cttcttttct tttaetcttc aaggtattgt gtttaatttt ttttcaactg 2700 
atatgatcaa ttattttgat ttcagcggtg tgattggtta tatgctctgc tctactgaag 2760 
gaccagaaat tgacttcaag aatccagtaa atccaattga caaagagaca gctcaagtca 2820 
agtccaaatt agcacctctc aagttctaca actctgatgt aacttcatat ctcacaattt 2880 
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cttttttcct attgtacttt atgttcttcg tcaaatttta taattaactc ttttcaaatt 2940 

gtcttttttt ttttcagatt cacaaagcag cattcatttt gccatctttc gccagaagta 3000 

tgatcgagtc ttaatcaact gattaatgaa tactggtggt acaatcattg gaccaagatc 3060 

aataagtgaa agacgtattg tatgagaatt c 3091 

<210> 3 
<211> 353 
<212> PRT 
<213> Plant 

<400> 3 

Met Glu Val He Ser Thr Asn Thr Asn Gly Ser Thr He Phe Lys Ser 
1 5 -10 .15 

Gly Ala He Pro Met Asn Gly His His Asn Gly Thr Ser Lys His Gin 
20 25 30. 

Asn Gly His Lys Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr He Ser 
35 40 45 

Leu Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Cys He Lys Pro 
50 55 60 

Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser 
65 70 75 80 

Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp 
85 90 95 

Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp 
100 105 110 

Gly Ala He Gin His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met 
115 120 125 

He Val His Leu Pro Leu Gly Ser He Pro Asn Pro Lys Lys Val Leu 
130 135 140 

He He Gly Gly Gly He Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr 
145 150 155 160 

Pro Thr He Glu Lys He Asp He Val Glu He Asp Asp Val Val Val 
165 170 175 

Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Asn Asp 
180 185 190 

Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala 
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195 



200 



205 



Ala Gin Ala Glu Tyr Tyr Asp Ala He He Val Asp Ser Ser Asp Pro 
210 215 220 

He Gly Pro Ala Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu Ala Val 
225 230 235 240 

Ala Lys Ala Leu Arg Pro Gly Gly Val Val Cys Thr Gin Ala Glu Ser 
245 250 255 

He Trp Leu His Met His He He Lys Gin He He Ala Asn Cys Arg 
260 265 270 

Gin Val Phe Lys Gly Ser- Val Asn Tyr Ala Trp Thr Thr Val Pro Thr 
275 280 • 285 

Tyr Pro Thr Gly Val He Gly Tyr Met Leu Cys Ser Thr Glu Gly Pro 
290 295 300 

Glu He Asp Phe Lys Asn Pro Val Asn Pro He Asp Lys Glu Thr Ala 
305 310 315 320 

Gin Val Lys Ser Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser Asp He 
325 330 335 

His Lys Ala Ala Phe He Leu Pro Ser Phe Ala Arg Ser Met lie Glu 
340 345 - 350 



Ser 



<210> 4 
<211> 711 
<212> DNA 
<213> Plant 

<400> 4 

gaattcaatg gagaaggaaa atatttccag 

aataatctct atcattcaag ccttaggtgg 

aagtaatagg tgatcaacag ctttcgtaaa 

tatgctgaag aacccacata aggaagatca 

gagctttatt tttaaagagt ggctagctgg 

aaattttact attatttgtt ttaagaggga 

agaggaagaa aggagacaga agaggaaata 

tcaaagaaaa tttttaaaat ggagagagaa 

actaataatt gcaccgagac aaacttatat 



tgtaaacaca agtgaatgaa gagaagccaa 60 

agattaaaaa aattatttac tttcttatca 120 

acgtcattag gagaatatta taatctcttt 180 

taaaatacat gactttcaga tgacttcttg 240 

tcagcaaaga ggtgctcgtc agatatcata 300 

gatggggcac acatgcttgt gacaaaagta 360 

gatttggggg gggggggggg ggtttcacaa 420 

atgagcacac acatatacta acaaaatttt 480 

tttagttcca aaatgtcagt ctaaccctgc 540 
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acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 
tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 
tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa a 711 

<210> 5 
<211> 3129 
<212> DNA 
<213> Plant 

<400> 5 

gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 
aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 
aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 
tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 
- gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 
aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 
agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 
tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 
actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 
acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 
tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 
tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 
atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 
caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 
aacgggcacc agaatggcac tttcgaacat cggaacggcc accagaatgg gacatccgaa 900 
caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 
attaagcctg "gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 
aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 
actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 
cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 
gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 
tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 132 0 
atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 138 0 
atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 
tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 
tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 
atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 
tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 
caggtatcca gaaaattttt cccttatctg gcagctaatt ttaacgatcc tcgtgtaacc 1740 
ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 
tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 
gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 
cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 
aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 
ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 
taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 
taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 222 0 
aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 
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gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt agatcctaaa 2340 
attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca tttactattc 2400 
tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat ttttatattt 2460 
taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata aaaaagattt 2520 
atagtcaaat aaatgttgtg accatataaa aacctccgca tttttaagat cataagtttc 2580 
agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt atgtcataaa 2640 
aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat tgtgtttaat 2700 
ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg tgatcggtta 2760 
tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa atccaattga 2820 
caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca actctgatgt 2880 
aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc aaattttata 2940 
actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc attttaccat 3000 
ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg gtagtacaat 3060 
cattggacca agatcgagtc ttaatcaagt gaataaataa gtgaaatgcg acgtattgta 3120 

3129 

ggagaattc 

<210> 6 

<211> 375 

<212> PRT . ' 

<213> Plant 

<400> 6 

Met Glu Val He Ser Thr- Asn Thr Asn Gly- Ser Thr He Phe Lys Asn 
1 5 . 10 15 

Gly Ala He Pro Met Asn Gly His Gin Asn Gly Thr Ser Glu His Leu 
20 25 30 



Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His Gin Asn 
35 40-45 

Gly Thr Phe Glu His Arg Asn Gly His .Gin Asn Gly Thr Ser Glu Gin 
50 55 60 

Gin Asn Gly Thr He Ser His Asp Asn Gly Asn Glu Leu Leu Gly Ser 
65 70 - .75 80 

Ser Asp Ser He Lys Pro Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp 
85 90 95 

Pro Gly Glu Ala Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly 
100 105 HO 

Lys Ser Asp Tyr Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly 
115 120 125 

Lys Val Leu Thr Leu Asp Gly Ala He Gin His Thr Glu Asn Gly Gly 
130 135 140 



6 



WO 00/67558 PCT/US00/12450 



Phe Pro Tyr Thr Glu Met lie Val His Leu Pro Leu Gly Ser lie Pro 
145 150 155 160 

Asn Pro Lys Lys Val Leu lie lie Gly Gly Gly He Gly Phe Thr Leu 
165 170 1*75 

Phe - Glu Met Leu Arg Tyr Pro Ser He Glu Lys He Asp He Val Glu 
180 185 190 

He Asp Asp Val Val Val Asp Val Ser Arg Lys Phe Phe Pro Tyr Leu 
195 200 205 

Ala Ala Asn Phe Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly 
210 215 - 220 

Ala Ala Phe Val Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala He He 
225 230 235 240 

Val Asp Ser Ser Asp Pro He Gly Pro Ala Lys Asp Leu Phe Glu Arg 
245 250 255 

Pro Phe Phe Glu Ala Val Ala Lys Ala Leu Arg Pro Gly Gly Val Val 
260 • 265 270 

Cys Thr Gin Ala Glu Ser He Trp Leu His Met His He He Lys Gin 
275 280 285 

He He Ala Asn Cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala 
290 295 300 

Trp Thr Thr Val Pro Thr Tyr Pro Thr Gly Val He Gly Tyr Met Leu 
305 310 315 320 

Cys Ser Thr Glu Gly Pro Glu Val Asp Phe Lys Asn Pro Val Asn Pro 
325 330 335 

He Asp Lys Glu Thr Thr Gin Val Lys Ser Lys Leu Gly Pro Leu Lys 
340 345 350 

Phe Tyr Asn Ser Asp He His Lys Ala Ala Phe He Leu Pro Ser Phe 
355 360 365 

Ala Arg Ser Met He Glu Ser 
370 375 



<210> 7 
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<211> 1134 
<212> DNA 
<213> Plant 



<4O0> 7 

gctgtacaaa aggatgtctc aaatcatttg 
ccccactatt aagacccatt atcactggca 
acctgtccct atttggaaga gtgtggtatg 
gaatactgat ggaagtttta acaaacaaat 
agatgaagag ggaggctttg tcatggcttt 
tgaagcagaa ttgaaagcca tcaagtatgg 
aaacttcatt gtggaaactg actcgaggat 
aagcaacaac aagttgaaac aagagaccga 
gacacctgtt acccattgcc ttcgcgaagc 
ggccaccaga gctaacgaag gtatcactca 
caagggccct ttcttcatgg atatgtggca 
atctaatttt tttttgtaag ttaattctgt 
tttttgccca tagactcttc ctctccttag 
gttcccctca gtgtaacctt tttttgttta 
cccccaacac cacaggggat agatacctgg 
tttactaata attgcacgga gacaaaactt 
atgcacgttg taatgatttt ttaactctat 
gtccaaattg tatttaaatg catagatatg 
aaatacaaac cataatactt tctcttctcc 



gaatattaat tctgcaatca acaagaaata 60 
caaaaattat gagatcatta aacatcttaa 120 
ggagatgcct cccagggagt acctaaagct 180 
tgggaaagca gggattggag ggattctcag 240 
ttcgatgcct ataatctata . ataacatcag 300 
gtgtgaatgg tgcaaataca aaggaatatc 360 - 
gatctatgac atactacaga ccaaaaatct 420 
gaaattaatg gagattctgg acacctgcag 480 
aaatcaagtg gcagactggt ttgctaaaga 540 
tacagatttt agacaggtat caaaagcggc 600 
ggtcccttat tttagaatta gatatgaaaa 660 
gtatagtgag aggaaatcgt ctaatatgta 720 
gtaaaaaggt agctccgagg taaggtttat 780 
tataatagac atggtatggg tccagctaaa 840 
gtgattggtt tattttttaa aaaaaaaaac 900 
atattttagt tccaaaatga cagtccaacc 960 
tatatcgagt tccgccctcc actcctcggt 1020 
tttattggga gtgtacatca agctttcaga 1080 
aatttgctta gtttaatttg gaaa 1134 



<210> 8 
<211> 3269 
<212> DNA 
<213> Plant 



<400> 8 

gctgtacaaa aggatgtctc aaatcatttg 
ccccactatt aagacccatt atcactggca 
acctgtccct atttggaaga gtgtggtatg 
gaatactgat ggaagtttta acaaacaaat 
agatgaagag ggaggctttg tcatggcttt 
tgaagcagaa ttgaaagcca tcaagtatgg 
aaacttcatt gtggaaactg actcgaggat 
aagcaacaac aagttgaaac aagagaccga 
gacacctgtt acccattgcc ttcgcgaagc 
ggccaccaga gctaacgaag gtatcactca. 
caagggccct ttcttcatgg atatgtggca 
atctaatttt tttttgtaag ttaattctgt 
tttttgccca tagactcttc ctctccttag 
gttcccctca gtgtaacctt tttttgttta 
cccccaacac cacaggggat agatacctgg 
tttactaata attgcacgga gacaaaactt 
atgcacgttg taatgatttt ttaactctat 



gaatattaat tctgcaatca acaagaaata 60 
caaaaattat gagatcatta aacatcttaa 120 
ggagatgcct cccagggagt acctaaagct 180 
tgggaaagca gggattggag ggattctcag 240 
ttcgatgcct ataatctata ataacatcag 300 
gtgtgaatgg tgcaaataca aaggaatatc 360 
gatctatgac atactacaga ccaaaaatct 420 
gaaattaatg gagattctgg acacctgcag 480 
aaatcaagtg gcagactggt ttgctaaaga 540 
tacagatttt agacaggtat caaaagcggc 600 
ggtcccttat tttagaatta gatatgaaaa 660 
gtatagtgag aggaaatcgt ctaatatgta 720 
gtaaaaaggt agctccgagg taaggtttat 7 80 
tataatagac atggtatggg tccagctaaa 840 
gtgattggtt tattttttaa aaaaaaaaac 900 
atattttagt tccaaaatga cagtccaacc 960 
tatatcgagt tccgccctcc actcctcggt 1020 
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gtccaaattg tatttaaatg catagatatg 
aaatacaaac cataatactt tctcttctcc 
gtcatatcta ccaacacaaa tggctctact 
ggttaccaga atggcacttc caaacaccaa 
cggaacggcc accagaatgg gatttccgaa 
gagcatcaga acggccatca gaatgggaca 
ctactgggaa gctccaactc tattaagcct 
ccaggttagt actaagaaag aaactcaaat 
taatttagat gatggtgttt gactaagcac 
attgttacta tagagagcta tatctttagg 
aaattgggat gaagtaagca gttaacttcg 
atcgaatttt gcaggtgaag cattctcact 
gtctgattac caagatgtca tgctctttga 
atgtatgatt atatttaaag tggtattttt 
tacagtcagc aacatatggg aaggttctga 
atggtggatt tccatacact gaaatgattg 
ctaaaaaggt tttgatcatc ggcggaggaa 
atcctacaat cgaaaaaatt gacattgttg 
aaacttcttt tactcacata aaaaaatgat 
acaaaatttt ccgttttaca ggtatctaga 
agcgatcctc gtgtaaccct agtccttgga 
ttttatcttt tacttttatt ttatttaatt 
gccattggtt ctttttattt caggggctgc 
tgatgctatt atagtggact cttctgatcc 
gactattctt attaaataag ctactaataa 
taatttcagg tccagcaaaa gacttgtttg 
ccctaaggcc aggaggagtt gtatgcacac 
ttattaagca aatcattgct aactgtcgtc 
ggactactgt tccaacatat ccaacgtatt 
ttttgattct ataattgtca agaaatggag 
cttttactct tcaaggtgtg tttaagtttt 
tcagcggtgt gattggttat atgctctgtt 
atccagtaaa tccaattgac aaagagacaa 
agttctacaa ctctgatgta acttcatatc 
ttcttagtca aattttataa ttaactcttt 
agcagcattc attttgccat ctttcgccag 
atgaatactg gcggtacaat cattggacca 
gtgaaatgcg acgtattgta taagaattc 

<210> 9 
<211> 381 
<212> PRT 
<213> Plant 



tttattggga gtgtacatca agctttcaga 1080 
aatttgctta gtttaatttg gaaaatggaa 1140 
atcttcaaga atggtgccat tcccatgaac 1200 
aacggccacc agaatggcac ttccgaacat 1260 
caccaaaacg gccaccagaa tggcacttcc 1320 
atcagccatg acaacggcaa cgagctacag 1380 
ggttggtttt cagagtttag cgcattatgg 1440 
gcatcgtact cttgtattct gctttgcgta 1500 
tgagtttaaa aataaaaagt ttaaagttaa 1560 
aactaactaa aaaggaaaaa ttatcacata 1620 
catattattc gacacattaa tttgaaataa 1680 
taaggttgag aagttactat tccaggggaa 1740 
ggtaattaat taatactaat agtcaagctc 1800 
cgtttatttt taatttattg cacgtgtacg 1860 
ctttggatgg agcaattcaa cacacagaga 1920 
ttcatcttcc acttggttcc atcccaaacc 1980 
ttggttttac attattcgaa atgcttcgtt 2040 
agatcgatga cgtggtagtt gatgtaagtc 2100 
ttagattctt atttttctaa aagaattaaa 2160 
aaatttttcc cttatcttgc tgctaatttt 2220 
gatggtgcgt atttgataat ctcgttttta 22 8 0 
tttacctttt tgtgtgtggt taattcacct 234 0 
atttgtaaag gccgcacaag caggatatta 2400 
cattggtact ctattactac ttaataccaa 2460 
acgtaactct gatagttttc taaaataata 2520 
agaggccatt ctttgaggca gtagccaaag 2580 
aggctgaaag catttggctt catatgcata 2640 
aagtctttaa gggctctgtc aactatgctt 2700 
tttctctctc tcttcctata aaattggaag 27 60 
aatcagttcc aagaaaaacc aaattctttt 2 820 
ttaaactgat actgatcaat tattttgatt 2880 
ctactgaagg accagaagtt gacttcaaga 2940 
ctcaagtcaa gtccaaatta gcacctctca 3000 
tcaatttctt ttttcttatt gtactttatg 3060 
tcaaattgtc tttttttttc agattcacaa 3120 
aagtatgatc gagtcttaat caagtgacta 3180 
agatcgagtc ttaatcaagt gaataaataa 3240 

3269 



<400> 9 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
1 5 .10 15 
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Gly Ala He Pro Met Asn Gly Tyr Gin Asn Gly Thr Ser Lys His . Gin 
20 25 30 

Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin Asn 
35 40 .45 

Gly He Ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu His 
50 * 55 60 

Gin Asn- Gly His Gin Asn Gly Thr He Ser His Asp Asn Gly Asn Glu 
65 70 75 80 

Leu Gin Leu Leu Gly Ser Ser Asn Ser He Lys Pro Gly Trp Phe Ser 
85 90 95 

Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala Phe Ser Leu Lys Val Glu 
100 105 110 

Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr Gin Asp Val Met Leu Phe 
115 120 125 

Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr Leu Asp Gly Ala He Gin 
130 135 . 140 

His Thr Glu Asn Gly Gly Phe Pro Tyr Thr Glu Met He Val His Leu 
145 150 155 160 . 

Pro Leu Gly Ser lie Pro Asn Pro Lys Lys Val Leu He He Gly Gly 
165 170 175 

Gly He Gly Phe Thr Leu Phe Glu Met Leu Arg Tyr Pro Thr He Glu 
180 185 190 

Lys He Asp He Val Glu He Asp Asp Val Val Val Asp Val Ser Arg 
195 200 205 

Lys Phe Phe Pro Tyr Leu Ala Ala Asn Phe Ser Asp , Pro Arg Val Thr 
210 215 220 

Leu. Val Leu Gly Asp Gly Ala Ala Phe Val Lys Ala Ala Gin Ala Gly 
225 230 235 240 

Tyr Tyr Asp Ala He He Val Asp Ser Ser Asp Pro He Gly Pro Ala 
245 250 255 

Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu Ala Val. Ala Lys Ala Leu 
260 265 270. 
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Arg Pro Gly Gly Val Val Cys Thr Gin Ala Glu Ser lie Trp Leu His 
275 280 285 



Met His He lie Lys Gin He He Ala Asn Cys Arg Gin Val Phe Lys 

290 295 300 

Gly Ser Val Asn Tyr Ala Trp Thr Thr Val Pro Thr Tyr Pro Thr Gly 

305 310 315 '320 

Val He Gly Tyr Met Leu Cys Ser Thr Glu Gly Pro Glu Val Asp Phe 

325 330 335 

Lys Asn Pro Val Asn Pro He Asp Lys' Glu Thr Thr Gin Val Lys Ser 

340 345 350 

Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser Asp He His Lys Ala Ala 

355 360 365 

Phe He Leu Pro Ser Phe Ala Arg Ser Met He Glu Ser 

370 375 380 



<210> 10 
<211> 469 
<212> "DNA 
<213> Plant 

<400> 10 

gtcgacctct gattccacaa gtcatgcacc 

ccctgtacaa atggtacaaa tactttcctt 

cagatgcgaa gtaagaaagc agacgactaa 

gcacacacac gtactaataa aattagggta 

attcatattt tagttccaaa atgtctcggg 

ttaactctat tatctcgagt tgcgccctcc 

catatatgtc tattgggagt gtacagcgag 

tgaaacataa tactttctct tctccaattt 

<210> 11 
<211> 3001 
<212> DNA 
<213> Plant 



cattcaatta tttaatggaa accaatttta 60 
ggataaaaac aattttgcct aaggagtaaa 120 
agaaaatttt aaaaaaggag agagaaatga 180 
ctactttact aataattgga cagagactaa 240 
cagtccaacc atgcacgttg taatgagttt 300 
actcctctgt gtccaagttg tatataaatg 360 
ctttcataaa gtacaaatca taatacttgt 420 
gtttagttta attttgaaa 469 



<400> 11 

gtcgacctct gattccacaa 
ccctgtacaa atggtacaaa 
cagatgcgaa gtaagaaagc 
gcacacacac gtactaataa 
attcatattt tagttccaaa 



gtcatgcacc cattcaatta 
tactttcctt ggataaaaac 
agacgactaa agaaaatttt 
aattagggta ctactttact 
atgtctcggg cagtccaacc 



tttaatggaa accaatttta 60 
aattttgcct aaggagtaaa 120 
aaaaaaggag agagaaatga 180 
aataattgga cagagactaa 24 0 
atgcacgttg taatgagttt . 300 
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ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 
catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 
tgaaacataa tactttctct tctccaattt gtttagttta attttgaaaa tggaagtcat 480 
atctaccaac acaaatggct cgacca'tctt caagaatggt gccattccca tgaatggcca 540 
ccagagtggc acttccaaac acctcaacgg ctaccagaac ggcacttcca aacaccaaaa 600 
cggccaccat aatggcactt ccgaacatcg gaacggccac cagaatggga tttccgaaca 660 
ccaaaacggc caccagaatg ggacttccga acatcggaac ggccaccaga atgggatttc 720 
cgaacaccaa aacggccacc agaatgggac ttccgaacac eaaaacggcc accagaatgg 780 
gacttccgaa caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaaa 840 
ctccaactct attaagcttg gttggttttc agagtttagc gcattatggc caggttagta 900 
ctgagaaaga aactcaaatt catatttaaa gttaaaattg ttaggctaat ataagaagtt 960 
gattttcttt tagtgattaa ttaaaaaagg aaagagtatc aaataaattc caaaaaatga 1020 
ccagtaactt cgcatattat tctacacatt aatttgaaat aaatcgaatt ttgcaggtga 1080 
agcattctcc cttaaggttg agaagttact atttcagggg aagtctgact accaagatgt 1140 
catgctcttt gaggtaaata atattctaat acacatgctt taatatgaat aaatactttt 1200 
aatttacttt tagtttattg cacgtgtacg tacagtcagc aacatatggg aaggttttga 1260 
ctttggatgg agcaattcaa cacacagaga atggtggatt tccatacact gaaatgattg 1320 
ttcatcttcc acttggttcc atcccaaacc caaaaaaggt tttgatcatc ggcggaggaa 1380 
ttggttttac attattcgaa atgcttcgtt atcctacaat cgaaaaaatt gacattgttg 1440 
aaatcgatga cgtggtagtt gatgtaagtc aaatttcttt tactcacata aaaaaatgat 1500 
ttagattgct tctttttatt tttctaaaag aataaatata ttctctctta gttttaaaca 1560 
aaattctctt tcttacaggt atctagaaaa tctttccctt atctcgcagc taattttaat 1620 
gatcctcgtg taaccctcgt tctcggagat ggtgcgtatt tataatctcg tttttgtttt 1680 
atcttttatt tttatttcat ttaatttacc tttttgtgtg tggttaattt acccgtcatt 1740 
ggttctcttt catttcaggg gctgcatttg taaaggctgc acaagcagga tattatgatg 1800 
ctattatagt ggactcttct gatcccattg gtactctatt actacttaat accaagacta 1860 
atcttattga ataagctact aataaactgt aattgatttc taaaataata taatttcagg 1920 
tccagcaaaa gatttgtttg agaggccatt ctttgaggca gtagccaaag ccctaaggcc 198 0 
aggaggagtt gtatgcacac aggccgaaag catttggctt catatgcata ttattaagca 2040 
aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactacgctt ggactactgt 2100 
tccaacatat cccacgtatt ttctctctct ctctcttcat ctttgaaaat tgaaaatcct 2160 
gactactttc cttcctttga ttcctcggtt aaaggggcgt agatcataag attttcaaga 2220 
aatagataat gacgtccaag aaaaactaac ttcttttcat ttactattct ttttggtgac 22 80 
aaactttatt tattatttcg ttctaaagag aaaatttatt tttatatttt aaaataattt 2340 
tgttttaaac ttttattttt acttattata tctttaataa aaaaattata gtcaaataaa 2400 
tattatggcc acactaaaca tccaagtttt tgaaaccata agttttagag ccaaatgagt 24 60 
taatttgttt ttggtatgcg ggtgcggagt caaattatgt cacaaaaatt gtaatggagt 2520 
gagcaaattt ttatttcgag taaactttca aggtattgtg ttaaagtttt ttcaactgat 2580 
actaatcaat tatgtctcaa ccattttgat ttcagtggtg taattgggta tatgctctgc 2640 
tctactgaag ggccagaagt tgacttcaag aatccaataa atccaattga caaagagaca 27 00 
actcaagtca agtccaaatt agcacctctc aagttttaca attctgatgt aacttcatat 27 60 
ctaacaattt ctttttctgt tttactgtat cttcattgtc aaaattttat aattaactct 2820 
tctcaaatta tctttttttt tagattcaca aagcagcatt cattttgcca tctttcgcca 2880 
gaagtatgat cgagtcttaa tcaagtgaat aatgaacact ggtggtgcaa tcattggacc 2940 
aagatcgagt cttaatcaag tgaataaata agtgaaatgc cgacgtattg tatgagaatt 3000 
c 3001 

<210> 12 
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<211> 419 
<212> PRT 

<213> Plant ' 
<400> 12 

Met Glu Val lie Ser Thr Asn Thr Asn Gly Ser Thr lie Phe Lys Asn 
I 5 10 15 

Gly Ala lie Pro Met Asn Gly His. Gin Ser Gly Thr Ser Lys His Leu 
20 25 30 

Asn Gly Tyr Gin Asn Gly Thr Ser Lys His Gin Asn Gly His His Asn 
35 40 45 

Gly Thr Ser Glu His Arg Asn Gly His Gin Asn- Gly lie Ser Glu His 
50- 55 60 

Gin Asn Gly His Gin Asn Gly Thr Ser Glu His Arg Asn Gly His Gin 
65 70 75 80 

Asn Gly He Ser Glu His Gin Asn Gly His Gin Asn Gly Thr Ser Glu 
65 90 95 

His Gin Asn Gly His Gin Asn Gly Thr Ser Glu Gin Gin Asn Gly Thr 
100 105 HO 

He Ser His Asp Asn Gly Asn Glu Leu Leu Gly Asn Ser Asn Ser He 
115 120 125 

Lys Leu Gly Trp Phe Ser Glu Phe Ser Ala Leu Trp Pro Gly Glu Ala 
130 135 140 

Phe Ser Leu Lys Val Glu Lys Leu Leu Phe Gin Gly Lys Ser Asp Tyr 
145 150 155 160 

Gin Asp Val Met Leu Phe Glu Ser Ala Thr Tyr Gly Lys Val Leu Thr 
165 170 175 

Leu Asp Gly Ala He Gin His Thr Glu Asn Gly Gly Phe Pro Tyr Thr 
180 185 190 

Glu Met He Val His Leu Pro Leu Gly Ser He Pro Asn Pro Lys Lys 
195 200 205 

Val Leu He He Gly Gly Gly He Gly Phe Thr Leu Phe Glu Met Leu 
.210 215 220 

Arg Tyr Pro Thr He Glu Lys He Asp He Val Glu He Asp Asp Val 
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225 230 235 240 

Val Val Asp Val Ser Arg Lys Ser Phe Pro Tyr Leu Ala Ala Asn Phe 
245 250 255 

Asn Asp Pro Arg Val Thr Leu Val Leu Gly Asp Gly Ala Ala Phe Val 
260 265 270 

Lys Ala Ala Gin Ala Gly Tyr Tyr Asp Ala lie He Val Asp Ser . Ser 
275 280 285 

Asp Pro He Gly Pro Ala Lys Asp Leu Phe Glu Arg Pro Phe Phe Glu 
.290 295 v 300~ 

Ala Val Ala Lys Ala Leu Arg Pro Gly . Gly Val Val Cys Thr Gin Ala 
305 310 315 320 

Glu Ser He Trp Leu His Met His He He Lys Gin He He Ala Asn 
325 330 335 

Cys Arg Gin Val Phe Lys Gly Ser Val Asn Tyr Ala Trp Thr Thr Val 
340 345 350 

Pro Thr Tyr Pro Thr Gly Val He Gly Tyr Met Leu Cys Ser Thr Glu 
355 360 365 

Gly Pro Glu Val Asp Phe Lys Asn Pro lie Asn Pro He Asp Lys Glu 
370 375 .380 

Thr Thr Gin Val Lys Ser Lys Leu Ala Pro Leu Lys Phe Tyr Asn Ser 
385 390 395 400 . 

Asp He His Lys Ala Ala Phe He Leu Pro Ser Phe Ala Arg Ser Met 
405 410 415 

He Glu Ser 



<210> 13 
<211> 1636 
<212> DNA 
<213> Plant 

<400> 13 

ggcacgagat cagatccaat tctcttctgt 

atctacaaag ttttcttcat tttcagaggg 

agtcagtcaa tgaaggccac cccgacaagc 



gcttcccttc tctgctctca aattcttcag 60 
cagacatgga aactttcttg ttcacctcag 120 
tctgcgacca ggtctcggat gcaattcttg 180 
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atgcttgctt agaacaggat ccagaaagca 
acatggttat ggtctttgga gagatcacaa 
tgcgtgacac atgcagaggc attgggttca 
actgcaaggt tcttgtcaac atcgagcagc 
gtcatcttac caagaaacca gaagagattg 
atgccactga tgaaacccca gagctcatgc 
gtgccaagct taccgaagtg aggaagaaca 
agacccaagt tactgttgag tacaagaacg 
acactgttct catctcaact caacatgacg 
acttgaaaga gcatgtgatc aaacctgtga 
tcttccacct caacccatca ggtcgcttcg 
ttaccggcag gaaaattatc attgacacct 
ctttctcagg aaaggaccct actaaggtgg 
cagcaaagag tgtggtcgcc tcaggacttg 
ctatcggtgt ggctgaacca ctttccgtgt 
cagacaagga tattttgact ctgatcaa^g 
caatcaacct tgacttgtta agaggaggca 
gtcactttgg ccgtgatgac cccgacttct 
aagcttaagt gaggtgtagc cttttggcca 
tcatcatatc atgcattggt ggcaggagaa 
tgagctctga gtcattgaac attgttattt 
gtaccttatt tttattttgt tactgttaag 
agcagtgatt taagttttcc ctgttaagta 
ttatatgtga acttgtcaat tatctcctga 
aaaaaaaaaa aaaaaa 



aggttgcatg tgaaacctgc acaaagacaa 240 
ccaaggccac tgttgactat gagaagatag 300 
cctcagcaga tgttggcctt gacgctgaca 360 
agagccctga cattgcccaa ggtgttcacg 420 
gagctggtga ccaaggtcac atgtttggct 480 
cccttaccca tgtttgggcc actaagcttg 540 
agacttgccc atggctcaga ccagatggca 600 
acaatggtgc catggtccca attagagttc 660 
aaactgtcac aaacgaccag attgcccagg 720 
tcccatctca gtaccttgat gagaatacca 780 
tcatcggtgg accacacgga gatgctggac 840 
acggaggctg gggtgcccat ggaggaggtg 900 
acaggagtgg tgcttatatt gttagacagg 960 
ctcgccgctg tattgtgcag gtttcttatg 1020 
ttgttgacac ttacaagact ggaacaattc 1080 
agaactttga cttcaggcct ggaatgatgt 1140 
acttcaggta ccagaagact gcagcttatg 1200 
catgggagac tgtcaaggtc ctcaagccaa 1260 
ttatttttct tgcagaccaa taaacaagct 1320 
gagaatttgt gtctccattg gaggattcta 1380 
ttctttcttt ttttttcacc cttttctgca 1440 
tagcagtgat ttaagttttc cctgttaagt 1500 
gctggaatta agtttccatg ttctatcata 1560 
ggtgaaagag tccttcaggg aatagtttaa 1620 

1636 



<210> 14 - 
<211> 390 
<212> PRT 
<213> Plant 

<400> 14 

Met Glu Thr Phe Leu Phe Thr Ser Glu Ser Val Asn Glu Gly His Pro 
15 10 15 

Asp Lys Leu Cys Asp Gin Val Ser Asp Ala lie Leu Asp Ala Cys Leu 
20 25 30 

Glu Gin Asp Pro Glu Ser Lys Val Ala Cys Glu Thr Cys Thr Lys Thr 
35 40 45 

Asn Met Val Met Val Phe Gly Glu lie Thr Thr Lys Ala Thr Val Asp 
50 55 60 

Tyr Glu Lys lie Val Arg Asp Thr Cys Arg Gly lie Gly Phe Thr Ser 
65 70 75 80 

Ala Asp Val Gly Leu Asp Ala Asp Asn Cys Lys Val Leu Val Asn He 
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90 
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95 



Glu Gin Gin Ser Pro Asp lie Ala Gin Gly Val His Gly His Leu Thr 
100 105 110 

Lys Lys Pro Glu Glu lie Gly Ala Gly Asp Gin Gly His Met Phe Gly 
115 120 125 

Tyr Ala Thr Asp Glu Thr Pro Glu Leu Met Pro Leu Thr His Val Trp 
130 135 140 

Ala Thr Lys Leu Gly Ala Lys Leu Thr Glu Val Arg Lys Asn Lys Thr 
145 150 155 160 

Cys Pro Trp Leu Arg Pro Asp Gly Lys Thr Gin Val Thr Val Glu Tyr 
165 170 175 

Lys Asn Asp Asn Gly Ala Met Val Pro lie Arg Val His Thr Val Leu 
180 185 190' 

lie Ser Thr Gin His Asp Glu Thr Val Thr Asn Asp Gin lie Ala Gin 
195 200 205 . . 

Asp Leu Lys Glu His Val He Lys Pro Val He Pro Ser Gin Tyr Leu 
210 215 220 

Asp Glu Asn Thr He Phe His Leu Asn Pro Ser Gly Arg Phe Val He 
225 230 235 240 

Gly Gly Pro His Gly Asp Ala Gly Leu Thr Gly Arg Lys He He He 
245 250 255 

Asp Thr Tyr Gly Gly Trp Gly Ala His Gly Gly Gly Ala Phe Ser Gly 
260 265 270 

Lys Asp Pro Thr Lys Val Asp Arg Ser Gly Ala Tyr lie Val Arg Gin 
275 280 285 

Ala Ala Lys Ser Val Val Ala Ser Gly Leu Ala Arg Arg Cys He Val 
290 295 , 300 

Gin Val Ser Tyr Ala He Gly Val Ala Glu Pro Leu Ser Val Phe Val 
305 310 315 320 

Asp Thr Tyr Lys Thr Gly Thr He Pro Asp Lys Asp lie Leu Thr Leu 
325 330 335 

He Lys Glu Asn Phe Asp Phe Arg Pro- Gly Met Met Ser He Asn Leu 
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340 

Asp Leu Leu Arg Gly Gly 

355' 

Gly His Phe Gly Arg Asp 
370 

Val Leu Lys Pro Lys Ala 
385 390 



345 

Asn Phe Arg Tyr Gin Lys 
360 

Asp Pro Asp Phe Ser Trp 
375 380 



PCT/USOO/12450 

350 

Thr Ala Ala Tyr 
365 

Glu Thr Val Lys 



<210> 15 
<211> 1596 
<212> DNA 
<213> Plant 



<400> 15 

ggcacgaggg gaacaagaga aacatcatat tattgaatcc ctagtttctt ttctttccct 60 
ttgattcctt cctctcattt acctctctct tttcttcctt tgtttggatg gccggccaaa 120 
caatcatcgt ttccgggttg aacccggcgg ccattcttca gtccacaatt ggcggcggag 180 
cttctcctac agcggcggcg gcggcggaaa acggcaccag aaaagtcatc cctctctcaa 240 
gagatgcctt acaagatttc atgttatcaa tcataaccca aaaattacaa gatgagaaac 300 
aaccttttta cgtgctagac ttgggtgagg ttgtttctct tatggaccaa tggaaatctg 360 
ctctcccaaa tatccgtcca ttttacgctg ttaaatgtaa ccctgaaccg tcgttccttt 420 
caattttatc tgctatgggc tcaaattttg attgtgctag ccgagctgaa attgagtatg 480 
ttttatctct tggcatttca cctgaccgta ttgttttcgc aaatccatgc aaaccggaat 540 
'ccgatattat ttttgcagca aaagttgggg tgaatcttac aacctatgat tctgaagacg 600 
aggtttacaa gatccgaaag catcacccga aatccgaact cttgctccgc atcaagccca 660 
tgctcgacgg caacgcgaga tgcccaatgg gcccgaaata cggcgcgctt ccagaagaag 720 
tcgacccgct gctccgggca gctcaagccg cccgtctcac cgtatccggc gtctcattcc 780 
acatcggtag cggagatgcc gattcaaacg cttatctcgg cgccatagcc gcggctaagg 840 
aagtgtttga aacagctgct aaactcggga tgtcgaaaat gactgttcta gacgtcggcg 900 
gcgggtttac atccggccac cagttcacaa ccgccgccgt cgccgttaaa tcagctttaa 960 
aacaacactt cgatgacgaa ccggagttga caatcatagc tgaaccgggt cggttttttg 1020 
cagagacggc gtttactttg gcaacgacga ttatagggaa aagagtgagg ggtgaattga 1080 
gggagtattg gattaacgac gggctgtacg gttcgatgaa ctgtgtactt tacgaccatg 1140 
cgacggtgaa tgcaacgccg ttagctgttc tgtcgaatcg tagtaacgtt acctgcggcg 1200 
ggtcgaaaac gtttccgacg actgtgtttg ggcccacttg tgatgctctt gatactgttt 1260 
taagggatta ccagttaccg gagctgcagg ttaatgattg gctggttttt cctaatatgg 1320 
gtgcttatac taaagctgct gggtccaatt ttaatggatt taatacttcc gccattgtta 1380 
ctcacctcgc ttattcttat ccaagctgat. gaaccacctg tattaggaat tactaccgtg 1440 
gttttgatgg ttttttcctt ttttgggtat ctttttttta attttgttgt ttttggtagt 1500 
aatttatatt ccaaatcagc ttgtaattct cttgtatgcc ataagaatgc aaggatttgc 1560 
taattgtgat tttctctaaa aaaaaaaaaa aaaaaa 1596 



<210> 16 
<211> 433 
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<212> PRT 
<213> Plant 

<400> 16 

Met Ala Gly Gin Thr lie He Val Ser Gly Leu Asn Pro Ala Ala He 
1*5 10 15 

Leu Gin Ser Thr He Gly Gly Gly Ala Ser Pro Thr Ala Ala Ala Ala 
20 25 30 

Ala Glu Asn Gly Thr Arg Lys Val He Pro Leu Ser Arg Asp Ala Leu 
35 40 45 

Gin Asp Phe Met Leu Ser He He Thr Gin Lys Leu Gin Asp Glu Lys 
50 55 60 

Gin Pro Phe Tyr Val Leu Asp Leu Gly Glu Val Val Ser Leu Met Asp 
65 70 . 75 80 

Gin Trp Lys Ser Ala Leu Pro Asn He Arg Pro Phe Tyr Ala Val Lys 
85 90 95 

Cys Asn Pro Glu Pro Ser Phe Leu Ser He Leu Ser Ala Met Gly Ser 
100 105 HO 

Asn Phe Asp Cys Ala Ser Arg Ala Glu He Glu Tyr Val Leu Ser Leu 
115 120 125 

Gly lie Ser Pro Asp Arg He Val Phe Ala Asn Pro Cys Lys Pro Glu 
130 135 140 

Ser Asp He He Phe Ala Ala Lys Val Gly Val Asn Leu Thr Thr Tyr 
145 150 155 160 

Asp Ser Glu Asp Glu Val Tyr Lys He Arg Lys His His Pro Lys Ser 
165 170 175 

Glu Leu Leu Leu Arg He Lys Pro Met Leu Asp Gly Asn Ala Arg Cys 
180 185 190 

Pro Met Gly Pro Lys Tyr Gly Ala Leu Pro Glu Glu Val Asp Pro Leu 
195 200 205 

Leu Arg Ala Ala Gin Ala Ala Arg Leu Thr Val Ser Gly Val Ser Phe 
210 215 220 

His He Gly Ser Gly Asp Ala Asp Ser Asn Ala Tyr Leu Gly Ala He 
225 230 235 240 
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Ala Ala Ala Lys Glu Val Phe Glu Thr Ala Ala Lys Leu Gly Met Ser 
245 250 255 

Lys Met- Thr Val Leu Asp Val Gly Gly Gly Phe Thr Ser Gly His Gin 
260 265 270 

Phe Thr Thr Ala Ala Val Ala Val Lys Ser Ala Leu Lys Gin His Phe 
275 280 285 

Asp Asp Glu Pro Glu Leu Thr lie He Ala Glu Pro Gly Arg Phe Phe 
290 295 300 

Ala Glu Thr Ala Phe Thr Leu Ala Thr Thr He He Gly Lys Arg Val 
305 310 315 320 

Arg Gly Glu Leu Arg Glu Tyr Trp He Asn Asp Gly Leu Tyr Gly Ser 
325 330 335 

Met Asn Cys Val Leu Tyr Asp His Ala Thr Val Asn Ala Thr Pro Leu 
340 345 350 

Ala Val Leu Ser Asn Arg Ser Asn Val Thr Cys Gly Gly Ser Lys Thr 
355 360 365 

Phe Pro Thr Thr Val Phe Gly Pro Thr Cys Asp Ala Leu Asp Thr Val 
370 375 380 

Leu Arg Asp Tyr Gin Leu Pro Glu Leu Gin Val Asn Asp Trp Leu Val 
385 390 395 400 

Phe Pro' Asn Met Gly Ala Tyr Thr Lys Ala Ala Gly Ser Asn Phe Asn 
405 410 415 

Gly Phe Asn Thr Ser Ala He Val Thr His Leu Ala Tyr Ser Tyr Pro 
420 425 430 

Ser 



<210> 17 
<211> 2074 
<212> DNA 
<213> Plant 

<400> 17 

tggtaactgg accgacgcga catttgtcgt atatgtctta atcgggctag tcgctgacaa 60 
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catcatccac caagtcaaag ttcggaaatt catatcgttt ctcatcatct tctatccgag 120 
aaatgagggg actatctgta tacggtcaaa accgagtctg cccttcatat gactaatcga 180 
gattagaaca taatggtcta aggttcatca ttataataac gagccatgat atagagttag 240 
gttgtcaagc tcaagcccca gagagcgatc aatatcgaga tcgagccaag gttaaactcg 300 
agacctagag accgatcaat accgagaccg accaagtcaa ggtcgagctc gagacccaga 360 
gaccggtcaa gattgagatc ggccaagatc gagatcgagc caagaaatta aaaagtcgtt 420 
atagccgcat ttagggagag aatctctgcg gaaatcacga cttgaatcag ggaaaaacta 480 
attaattaat ctatcatgtg atccccacta tgtattttta attatactca aaatgggatt 540 
cccccactat attaagagtg gttatcattt gtaatggaga gacatacaca cattcattct 600 
gacatataca gaaaatagag caaatactat ccttttttgg cttttgatat ttagtcatat 660 
tgtttcttct acccattgtt cttcactcaa tttggaggtg ataaaacttg aaggtttaag 720 
ctaactagtc cattcgggtt gcattcattt cttttacaat aatttcgtca tcatttattt 780 
attttctcaa ttgtactaag ttataccacg tatttttaga actgcgtata aattcaactc 840 
tatccatttt tcgggtaaac accgaataca tagcacaata gcaecctcaa ttgcaaaagt 900 
ccaaagccaa gggttcattc ctttctgaag aaatgagata gagaattgaa aatctaattt 960 
agttatctaa atctttataa tttagccttc catataagaa aaaggaaaca aattaactga 1020 
agaacaatag cctcgcatag atttaccttc tccatataaa ttttgtttat actcaatttt 1080 
tttgcaaatg tgtctaaaat gataggactt gcaaattttt atttaacatt tcctactcct 1140 
ctttaatttt caagaaatta attttaagca ttctcgattt gctctgcccg ctccgtcccg 1200 
ttgccatctc tgactcggat aggacctcga ttgcaaaaat ccaaacccaa ggaaccttcc 1260 
atacattaca taagccacaa aatagtaact attaaaaact accaatatat cctcaaatac 1320 
tcgcgattat ttcataccta acacgtttac cttatcttct cgtaatgacg ctacattagt 1380 
tagtgatata aaataccgaa tttaccacgc ggcaaccctc cgctgtctat ccacggcccg 1440 
agagaatctc ttagcccccc aaatacgaaa attaacttct agaattttat tttctggtta 1500 
ttaccatgaa aataaagaaa aagagaaaag tcaagaaatt taattgggct aatactgggg 1560 
tccactgccc agccacgcat ttccctccta tataaagcgt cgtcacctct catgcaaatc 1620 
tcgctcactt cacagttgtt agtttcacgt tctcttctca attcccataa aagaaaccct 1680 
tccgttaggt ttccgtccta ttttctcttc ttctacgctt cctcttctga tatcaatatc 1740 
tgtatggtgt ttttcttgtt cgaattttag atttgttttg cctttaatac ctgtaacctt 1800 
ataattctct gtttaaacca aaaacttagc ttcttctgaa gtcagggtgg ggatatttgg 1860 
atcgtgtaag agtgtgttag aaggtgatta tcttttgatt cagttccttt tttgcttctt 1920 
ttgagggggt agccggggcc tcggcctcgg cgggttttaa tagcccccat ctattacaac 1980 
cattgggcaa aaacatcatt aaatctgtac aaagcaaacc cttaatttag tttaattttc 2040 

2074 

tgtattcttt gattctttaa cagaagaaga agag 

<210> 18 
<211> 4321 
<212> DNA 
<213> Plant 

<400> 18 

tggtaactgg accgacgcga catttgtcgt atatgtctta atcgggctag tcgctgacaa 60 

catcatccac caagtcaaag ttcggaaatt catatcgttt ctcatcatct tctatccgag 120 

aaatgagggg actatctgta tacggtcaaa accgagtctg cccttcatat gactaatcga 180 

gattagaaca taatggtcta aggttcatca ttataataac gagccatgat atagagttag 240 

gttgtcaagc tcaagcccca gagagcgatc aatatcgaga tcgagccaag gttaaactcg 300 

agacctagag accgatcaat accgagaccg accaagtcaa ggtcgagctc gagacccaga 360 

gaccggtcaa gattgagatc ggccaagatc gagatcgagc caagaaatta aaaagtcgtt 420 
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atagccgcat ttagggagag aatctctgcg 
attaattaat ctatcatgtg atccccacta 
cccccactat attaagagtg gttatcattt 
gacatataca gaaaatagag caaatactat 
tgtttcttct acccattgtt cttcactcaa 
ctaactagtc cattcgggtt gcattcattt 
attttctcaa ttgtactaag ttataccacg 
tatccatttt tcgggtaaac accgaataca 
ccaaagccaa gggttcattc ctttctgaag 
agttatctaa atctttataa tttagccttc 
agaacaatag cctcgcatag atttaccttc 
tttgcaaatg tgtctaaaat gataggactt 
ctttaatttt" caagaaatta attttaagca 
ttgccatctc tgactcggat aggacctcga 
atacattaca taagccacaa aatagtaact 
tcgcgattat ttcataccta acacgtttac 
tagtgatata aaataccgaa tttaccacgc 
agagaatctc ttagcccccc aaatacgaaa 
ttaccatgaa aataaagaaa aagagaaaag 
tccactgccc agccacgcat ttccctccta 
tcgctcactt cacagttgtt agtttcacgt 
tccgttaggt ttccgtccta ttttctcttc 
tgtatggtgt ttttcttgtt cgaattttag 
ataattctct gtttaaacca aaaacttagc 
atcgtgtaag agtgtgttag aaggtgatta 
ttgagggggt agccggggcc tcggcctcgg 
cattgggcaa aaacatcatt aaatctgtac 
tgtattcttt gattctttaa cagaagaaga 
cgctactgtt tcccctcctc tcggctatgc 
ggagttcttt acctccggcg tacctcctac 
ggatctgtcc tctgctttgt acggggtcga 
ctctaacgga gatatctccg tccgaccaca 
tgaccttctc aaggtcgtga aaaaggcctc 
tcagctgcct cttgttgttc gcttccctga 
atcggctttt gatctcgctg ttcattccca 
tcccgtgaaa tgcaatcaag acaggttcgt 
attccggttc gggttggaag ctgggtctaa 
ctgcaggggc agtgctgagg gccttctcgt 
ttcgcttgct ttggttgcaa gaaagctcat 
ggaggagctt gaccttgtga ttgatataag 
acttcgggct aagctcagga ccaagcattc 
aggtaagttt gggcttacaa cgacccaaat 
cggaatgctg gattgccttc agttgctgca 
ggcgttgctt gctgatggtg ttggtgaggc 
tggtgcgggt atgaagttca ttgatactgg 
taaatcatgt gattcagatg tctctgttgg 
tgtccaggcg gttcaatatg tttgcgaccg 
cgaaagtggc agggcaattg tttctcatca 
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gaaatcacga cttgaatcag ggaaaaacta 480 
tgtattttta attatactca aaatgggatt 540 
gtaatggaga gacatacaca cattcattct 600 
ccttttttgg cttttgatat -ttagtcatat 660 
tttggaggtg ataaaacttg aaggtttaag 720 
cttttacaat aatttcgtca tcatttattt 780 
tatttttaga actgcgtata aattcaactc 840 
tagcacaata gcaccctcaa ttgcaaaagt 900 
aaatgagata gagaattgaa aatctaattt 960 
catataagaa aaaggaaaca aattaactga 1020 
tccatataaa ttttgtttat actcaatttt 1080 
gcaaattttt atttaacatt tcctactcct 1140 
ttctcgattt gctctgcccg ctccgtcccg 1200 
ttgcaaaaat ccaaacccaa ggaaccttcc 1260 
attaaaaact accaatatat cctcaaatac 1320 
cttatcttct cgtaatgacg ctacattagt 1380 
ggcaaccctc cgctgtctat ccacggcccg 1440 
attaacttct agaattttat tttctggtta 1500 
tcaagaaatt taattgggct aatactgggg 1560 
tataaagcgt cgtcacctct catgcaaatc 1620 
tctcttctca attcccataa aagaaaccct 1680 
ttctacgctt cctcttctga tatcaatatc 1740 
atttgttttg cctttaatac ctgtaacctt 1800 
ttcttctgaa gtcagggtgg ggatatttgg 1860 
tcttttgatt cagttccttt tttgcttctt 1920 
cgggttttaa tagcccccat ctattacaac 1980 
aaagcaaacc cttaatttag tttaattttc 2040 
agagatgccg gccctaggtt gttgcgtaga 2100 
cttctctcgg gatagctctc ttcccgcgcc 2160 
aaactccgcc gccggttcca ttgggtctcc 2220 
tgggtgggga gctccttatt tctccgttaa 2280 
tggtacggac acactccccc accaggaaat 2340 
cgacccgaaa aattcagggg ggctcgggct 2400 
tgtgctaaaa aaccggttgg aatctctgca 2460 
gggctatggg gcccactacc aaggtgttta 2520 
ggtggaagat attgtcaaat tcgggtcgtc 2580 
acccgagctc ctgttagcca tgagctgtct 2640 
ttgcaatggt ttcaaggacg ctgagtacat 2700 
gttaaacact gtaattgttc ttgaacaaga 2760 
ccgtaagatg gctgttcggc ccgtaattgg 2820 
aggccatttt ggatccactt ctggagaaaa 2880 
tgttcgtgta gtgaagaagc tggaagaatc 2 940 
ttttcacatt ggatctcaga tcccttcaac 3000 
tgctcagatt tattgtgaat taatccgtct 3060 
aggtgggctc ggaattgatt atgatggtac 3120 
ctatggcatt caagaatacg cctccacagt 3180 
taagggcgtg aagcacccag tgatttgcag 3240 
ctcaattctg attttcgaag ccgtgtctgc 3300 
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ttctagtcac tcatgttctt cttcacatct 
gacgctcaat gaagatgccc ttgctgatta 
agagtacgag acgtgtgtac tttactctga 
taaagaaggg tccttgggta ttgaacatct 
atcaaaggct atgggggctg ctgatcctat 
cacttcaatt cctgattttt gggcctttgg 
tttagatgaa aagcctgcag taaggggaat 
gaaggttgat aagttcattg gtggcgaatc 
tggcgatggt ggtgggtatt atctggggat 
cggaggactc cacaacctgt ttggtggacc 
cgctcacagc ttcgccatgt ctcgctccgt 
agcgatgcag cacgagcccg agctcatgtt 
cttggaacaa gaagaagaca aagggctggc 
gtccttccat aacatgcctt accttgtggc 
tgctaacaac ggtggctata actactatta 
aggggaggat gagatttggt cctattgcac 
tttagtttgt cgtcgaagtt gtctgttttt 
t 
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gtcttctggt ggcctccaat ccatggcgga 3360 
ccgcaattta tctgctgctg cagttcgtgg 3420 
tcagttgaaa cagagatgtg tggatcagtt 3480 
tgctgctgtt gatagcatct gtgattttgt 3540 
ccgcacttac catgtgaatc tgtcaatttt 3600 
tcaattgttt ccgattgttc caatacaccg 3660 
attatcggac ttgacttgtg acagtgatgg 3720 
aagcttgcag ctgcatgaat tgggaagtaa 3780 
gtttttgggt ggggcttatg aggaggcgct 3840 
aagcgtggtg cgcgtggtgc agagcgatag 3900 
ccctggcccg tcctgcgcgg acgtgctccg 3960 
cgagactctc aagcaccgtg cggaggaatt 4020 
cattgcatct ttggccagca gcttagctca 4080 
gcctgcatct tgctgcttca ctgcagttac 4140 
cagtgatgag aatgcagcag attctgctac 4200 
tgcttgaagt gttgtcgtag catctccagt 4260 
gaataatacc cttagttggt gatgtttttc 4320 

4321 



<210> 19 
<211> 720 
<212> PRT 
<213> Plant 

<400> 19 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
1 5 10 15 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 • 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser lie Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala Pro 
50 55 60 

Tyr Phe Ser Val Asn Ser Asn Gly Asp He Ser Val Arg Pro His Gly 
65 70 . 75 80 

Thr Asp Thr Leu Pro His Gin Glu He Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 HO 

Leu Val Val Arg Phe -Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 
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Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala His 
130 135 140 

Tyr Gin Gly Val Tyr Pro Val Lys cys Asn Gin Asp Arg Phe Val Val 
145 150 155 160 

Glu Asp He Val Lys Phe Gly Ser Ser Phe Arg Phe Gly Leu Glu Ala 
165 170 175 

Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Arg Gly 
180 185 190 

Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu Tyr 
195 200 205 

He Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val He 
210 215 220 

Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val He Asp He Ser Arg 
225 230 235 240 

Lys Met Ala Val Arg Pro Val He Gly Leu Arg Ala Lys Leu Arg Thr 
245 250 255 

Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys Phe 
260 265 270 

Gly Leu Thr Thr Thr Gin He Val Arg Val Val Lys Lys Leu Glu Glu 
275 280 285 

Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His He Gly Ser 
290 295 300 

Gin He Pro Ser Thr Ala Leu Leu Ala Asp Gly Val Gly Glu Ala Ala 
305 310 315 320 

Gin He Tyr Cys Glu Leu He Arg Leu Gly Ala Gly Met 1 Lys Phe He 
325 330 335 

Asp Thr Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser Cys 
340 345 350 

Asp Ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser Thr 
355 360 365 



Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys His 
370 375 380 
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Pro Val Ile Cys Ser Glu Ser Gly Arg Ala lie Val Ser His His Ser 
385 390 395 400 

lie Leu lie Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser Ser 
405 410 415 

Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu Asn 
420. 425 430 

Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val Arg 
435 440 445 

Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin Arg 
450 455 460 

Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser He Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 495 

Asp Pro He Arg Thr Tyr His Val Asn Leu Ser He Phe Thr Ser He 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro He Val Pro He His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly He Leu Ser Asp Leu Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe He Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
. 565 570 575 

Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
625 630 635 640 



24 



WO 00/67558 



PCT7US00/12450 



Thr Leu Lys His Arg Ala Glu Glu 
645 

Gly Leu Ala lie Ala Ser Leu Ala 
660 

Asn Met Pro Tyr Leu Val Ala Pro 
675 680 

Thr Ala Asn Asn Gly Gly Tyr Asn 
690 695 

Ala Asp Ser. Ala Thr Gly Glu Asp 
705 710 



Phe Leu Glu Gin Glu Glu Asp Lys 
650 655 

Ser Ser Leu Ala Gin Ser Phe His 
665 670 

Ala Ser Cys Cys Phe Thr Ala Val 
685 

Tyr Tyr Tyr Ser Asp Glu Asn Ala 
700 

Glu lie Trp Ser Tyr Cys Thr Ala 
715 720 



<210> 20 
<211> 2118 
<212> DNA 
<213> Plant 



<400> 20 

gaattcctta tccggatttc tggtacgcag actgtaatat ggagtcatct tctcctcgat 60 
tcgggattaa aattaggtga cttgggacac cctaaatctc ccaagtggcg actctgaaat 120 
aaataaacaa atcccgtttc gattgtcctt aaattggaaa aaactccctt gtaccctccc 180 
gggtacggaa aaaggaggtg tacagcaatg acccaaaact tttattgcta tacattttga 240 
ggaatcaact tgatcaaaat ttatgggtga aattcaatgt ggtatgattt atattaggtc 300 
ggactttagc agatgtggtc acttcaattt gcggcaaaaa taatgtacag ggataataat 360 
aaaaagtact agaaatttga gtcataaagc tttttcaatt ttacaaaaga tattaagata 420 
cttattaaat caaatgtact ttattaatgt aatagcatga aaaaacagcc tcatccgcct 480 
gtcctcaccc cacaaaaagg agatagagaa aggaaactaa tcttatttaa ttttccacat 540 
ataaaattta tatccttgta taaatcccca aaaaaaaaaa atcaatacta attattttaa 600 
tttaatcatc cgtataagaa agaagctaat taactgactt acaaactgaa tagatagcac 660 
aatagcactc tcaattacaa aaatccaaag ccgagggtca ttcctttcat caagaaatta 720 
gatagggaat ggaaaatata atttaattat ctgaatcttt ataatttatc cttccatata 780 
agaaaaagga aacaaattaa ctgaagagca tatagcctcg catagattta ccttctccat 840 
atgggtgggg aaaccgacaa accgcaccaa ttcgataatt cgagtcaaac tgaggaaaaa 900 
aaaattcgac tatggtttgg tttgatttgg tttattgttg ggataaaaaa tcgatcataa 960 
ttggtttggt ttggttttaa ctaaagaaag tcaaaccgaa accaaacaaa cccgacatta 1020 
catatataaa ttttttagat atatttaata tataaatata cttgttgtga tgtaatttat 1080 
aaatatttct taaaaatatt cataatttta tcttttaaga tattatttcg tacttagaac 1140 
ttttgaatgt ttcttactcc tctttaattt gttagaaatt aatttgaagg agtttgaatt 1200 
tgctccgccc ccgctccgtc ccgttgccat ccctgactca ataggataac agcaatctcg 1260 
attgcaaaaa tccaaaccca aggaaccttc ccaacattac ataagctaca aagtagagta 1320 
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gtttattaaa taactaccaa tatatcctca aattctcgcg attatttcat acctaacacg 1380 
cttaccttat cttctcgtaa tgacgctaca ttagttggtg atataaaata ccgaatttgc 1440 
cacgcggcaa tcctccgctg tctatccacg gcccgagaga atctcttagc cccccaaaga 1500 
tgaaaattaa cttctagaat tttattttct ggttattacc atgaaaataa ttaaataaaa 1560 
aaaaagagaa aagtaaagat atttaattgg gctaaaactg gggtccacgg cccagccacg 1620 
catttccctc ctatataaag cgtcgtcacc tctcatgcaa atctcgctca ctacacagtt 1680 
gttagtttca cgttctcttc tcaattccca taacagaaac ccttccgtta ggtttccgtc 1740 
ctatttttcc tcatcttctc cgtttcctct tctgaaatca atatctgtat ggtgtttttc 1800 
ttgttcgaat tttagatttg ttttgtcttt aatacctata accttaaatt ctctgtttaa 1860 
accaaaaact tagcttcttc tgaagtcagg gtggggattt ttggatcgtg taagagtgtg 1920 
ttagagggtg attatctttt gattcagttc cttttttgct tcttttgagg gggtagccgg 1980 
ggcctcggcc tcggcgggtt ttaatagccc ccatctatta caactattgg gcaaaaacat 2040 
cattaaatct gtacaaaaca aacccttaat ttagtttaat tttctgtatt cattgatttt 2100 
ttaacagaag aagaagag 



2118 



<210> 21 
<211> 4368 
<212> DNA. 
<213> Plant . 



<400> 21 

gaattcctta tccggatttc tggtacgcag 
tcgggattaa aattaggtga cttgggacac 
aaataaacaa atcccgtttc gattgtcctt 
gggtacggaa aaaggaggtg tacagcaatg 
ggaatcaact tgatcaaaat ttatgggtga 
ggactttagc agatgtggtc acttcaattt 
aaaaagtact agaaatttga gtcataaagc 
cttattaaat caaatgtact ttattaatgt 
gtcctcaccc cacaaaaagg agatagagaa 
ataaaattta tatccttgta taaatcccca 
tttaatcatc cgtataagaa agaagctaat 
aatagcactc tcaattacaa aaatccaaag 
gatagggaat ggaaaatata atttaattat 
agaaaaagga aacaaattaa ctgaagagca 
atgggtgggg aaaccgacaa accgcaccaa 
aaaattcgac tatggtttgg tttgatttgg 
ttggtttggt ttggttttaa ctaaagaaag 
catatataaa ttttttagat atatttaata 
aaatatttct taaaaatatt cataatttta 
ttttgaatgt ttcttactcc tctttaattt 
tgctccgccc ccgctccgtc ccgttgccat 
attgcaaaaa tccaaaccca aggaaccttc 
gtttattaaa taactaccaa tatatcctca 
cttaccttat cttctcgtaa tgacgctaca 
cacgcggcaa tcctccgctg tctatccacg 
tgaaaattaa cttctagaat tttattttct 
aaaaagagaa aagtaaagat atttaattgg 



actgtaatat ggagtcatct tctcctcgat 60 
cctaaatctc ccaagtggcg actctgaaat 120 
aaattggaaa aaactccctt gtaccctccc 180 
acccaaaact tttattgcta tacattttga 240 
aattcaatgt ggtatgattt atattaggtc 300 
gcggcaaaaa taatgtacag ggataataat 360 
tttttcaatt ttacaaaaga tattaagata 420 
aatagcatga aaaaacagcc tcatccgcct 480 
aggaaactaa tcttatttaa ttttccacat 54 0 
aaaaaaaaaa atcaatacta attattttaa 600 
taactgactt acaaactgaa tagatagcac 660 
ccgagggtca ttcctttcat caagaaatta 720 
ctgaatcttt ataatttatc cttccatata 780. 
tatagcctcg catagattta ccttctccat 840 
ttcgataatt cgagtcaaac tgaggaaaaa 900 
tttattgttg ggataaaaaa tcgatcataa 960 
tcaaaccgaa accaaacaaa cccgacatta 1020 
tataaatata cttgttgtga tgtaatttat 1080 
tcttttaaga tattatttcg tacttagaac 1140 
gttagaaatt aatttgaagg agtttgaatt 1200 
ccctgactca ataggataac agcaatctcg ,1260 
ccaacattac ataagctaca aagtagagta 1320 
aattctcgcg attatttcat acctaacacg 1380 
ttagttggtg atataaaata ccgaatttgc 1440 
gcccgagaga atctcttagc cccccaaaga 1500 
ggttattacc atgaaaataa ttaaataaaa 1560 
gctaaaactg gggtccacgg cccagccacg 1620 
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catttccctc ctatataaag cgtcgtcacc 
gttagtttca cgttctcttc tcaattccca 
ctatttttcc tcatcttctc cgtttcctct 
ttgttcgaat tttagatttg ttttgtcttt 
accaaaaact tagcttcttc tgaagtcagg 
ttagagggtg attatctttt gattcagttc 
ggcctcggcc tcggcgggtt ttaatagccc 
cattaaatct gtacaaaaca aacccttaat 
ttaacagaag aagaagagat gccggcccta 
cctcctctca gctatgcctt ctctcgggat 
tccggcgtac ctccgacaaa ttctgccgct 
gctttatacg gggtcgatgg gtggggagct 
atctccgtcc gaccacacgg tacggacact 
gtcgtgaaaa aggcctccga cccgaaaaat 
gttgttcgct tccctgatgt gttgaaaaac 
ctcgcggttc attcccaggg ctatggggcc 
aatcaagaca ggttcgtggt ggaagatatc 
ttggaagccg ggtctaaacc cgagctcctg 
gctgagggcc ttctcgtttg caatggtttc 
gttgcaagaa agctcatgtt aaacactgta 
cttgtgattg atataagcca taagatggct 
ctcaggacca agcattcagg ccattttgga 
cttacaacga cccaaattgt tcgtgtggtg 
tgtcttcagt tgctgcattt tcacattgga 
gatggagttg gtgaggccgc tcagatttat 
aagttcattg atattggagg tgggcttgga 
tctgatgtct ctgttggcta tggcattcaa 
caatatgtat gcgaccgtaa gggtgtgaaa 
gcaattgttt ctcatcactc aattctgatt 
tgttcttctt cacatctgtc ttctggtggc 
gatgcccttg ctgattaccg caatttatct 
tgtgtacttt actctgatca gttgaaacag 
ttgggtattg aacatcttgc tgctgttgat 
ggggctgctg atcctgtccg cacttaccat 
gatttttggg cctttggtca attgtttccg 
cctgcagtga. ggggaatatt atcggactta 
ttcattggtg gcgaatcaag cttgccgcta 
ggttattatc tggggatgtt tttgggtggg 
aacctgtttg gtggaccaag tgtcgtgcgc 
gccatgactc gctccgtccc tggcccgtct 
gagcccgagc tcatgttcga gactctcaag 
gatgacaaag ggctggctgt tgaatctttg 
atgccttacc ttgtggcgcc ttcatcttgc 
ggctataatt actattacag tgatgagaat 
atttggtcct attgcactgc ttgaagtgtt 
cgaggttgtc tgtttttgaa taataccctt 



tctcatgcaa atctcgctca ctacacagtt 1680 
taacagaaac ccttccgtta ggtttccgtc 174 0 
tctgaaatca atatctgtat ggtgtttttc 1800 
aatacctata accttaaatt- ctctgtttaa 1860 
gtggggattt ttggatcgtg taagagtgtg 1920 
cttttttgct tcttttgagg gggtagccgg 1980 
ccatctatta caactattgg gcaaaaacat 2040 
ttagtttaat tttctgtatt cattgatttt 2100 
ggttgttgtg tagatgctgc tgttgtttcc 2160 
agctctcttc ccgcgccgga gttctttgcc 2220 
gcttccattg ggtctccgga tttgtcgtct 2280 
ccttatttct ctgttaactc taatggagat 2340 
ctccctcacc aggaaattga ccttctcaag 2400 
tcaggtgggc ttgggcttca gctgcctctt 2460 
cggttggaat ctctgcaatc ggcttttgat 2520 
cactaccaag gtgtttatcc cgtgaaatgc 2580 
gtgaaattcg ggtcgccatt ccggttcggg 2 640 
ttagccatga gctgtctctg caagggcagt 2700 
aaggacgctg agtacatttc gcttgctttg 2760 
attgtgcttg aacaagagga ggagcttgac 2 820 
gttcggcctg taattggact tcgggctaag 2880 
tccacttctg gagaaaaagg taagtttggg 2940 
aagaagctag aagaatccgg aatgctggat 3000 
tctcagatcc cttctacggg gttgctagct 3060 
tgtgaattag tccgtcttgg agcgggtatg 3120 
attgattatg atggtactaa atcatgcgat 3180 
gaatatgcct ccgcagttgt tcaggcggtt 324 0 
cacccagtga tctgcagcga aagtggcagg 3300 
ttcgaagccg tgtctgcttc tagtcactca 3360 
ctccaatcca tggcggagac gctcaacgaa 3420 
gctgctgcag ttcgtggaga gtatgagaca 3480 
agatgtgtgg atcagtttaa agaagggtcc 3540 
agcatctgtg attttgtatc aaaggctatg 3600 
gtgaatctgt caattttcac ttcaattcct 3660 
attgttccaa ttcaccgctt agatgaaaag 3720 
acttgtgaca gtgatgggaa ggttgataag 3780 
catgaattgg gaagtaatgg cgatggtggt 3840 
gcttatgagg aggcgctcgg aggactccac 3900 
gtggtgcaga gcgatagcgc tcacagcttt 3960 
tgcgctgatg tgctccgagc gatgcagcac 4020 
caccgtgcgg aggaattctt ggaacaagaa 4080 
gccagcagcg tagctcagtc cttccataac 4140 
cgcttcactg ctgctactga taacaatggt 4200 
gcagcagatt ctgctacagg ggaggatgag 4260 
ctcgtagcat ctccagtctt agtttgtcgt 4320 
agttggtgat gtttttct 4368 



<210> 22 
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<211> 721 
<212> PRT 
<213> Plant 

<400> 22 

Met Pro Ala Leu Gly Cys Cys Val Asp Ala Ala Val Val Ser Pro Pro 
1 5 10 15 

Leu Ser Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Giu Phe 
20 25 30 

Phe Ala Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Ala Ser lie Gly 
35 40 45 

Ser Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp Gly Ala 
50 55 60 

Pro Tyr Phe Ser Val Asn Ser Asn Gly Asp lie Ser Val Arg Pro His 
65 70 75 80 

Gly Thr Asp Thr Leu Pro His Gin Glu lie Asp Leu Leu Lys Val Val 
85 90 95 

Lys Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu 
100 105 110 

Pro Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser 

115 120 , 125 - 

Leu Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala 
130 . 135 . 140 

His Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val 
145 150 155 160 

Val Glu Asp lie Val Lys Phe Gly Ser Pro Phe Arg Phe Gly Leu Glu 
165 170 175 

Ala Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Lys 
180 185 . 190 



Gly Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu 
195 200 205 

Tyr lie Ser Leu Ala Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val 
210 215 220 

lie Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val lie Asp lie Ser 
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225 230 235 240 

His Lys Met Ala Val Arg Pro Val He Gly Leu Arg Ala Lys Leu Arg 
245 250 255 

Thr Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys 
260 265 270 

Phe Gly Leu Thr Thr Thr Gin He Val Arg Val Val Lys Lys Leu Glu 
275 280 2B5 

Glu Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His Phe His He Gly 
290 295 300 

Ser Gin lie Pro Ser Thr Gly Leu Leu Ala Asp Gly Val Gly Glu Ala 
305 310 315 320 

Ala Gin He Tyr Cys Glu Leu Val Arg Leu Gly Ala Gly Met Lys Phe 
325 330 335 

He Asp He Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser 
340 345 350 

Cys Asp Ser Asp Val Ser Val Gly Tyr Gly lie Gin Glu Tyr Ala Ser 
355 360 365 

Ala Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys 
370 375 380 

His Pro Val He Cys Ser Glu Ser Gly Arg Ala lie Val Ser His His 
385 390 395 400 

Ser He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser 
405 410 415 

Ser Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu 
420 425 430 

Asn Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val 
435 , 440 445 



Arg Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin 

450 455 460 

Arg Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu 

465 470 475 480 

Ala Ala Val Asp Ser lie Cys Asp Phe Val Ser Lys Ala Met Gly Ala 
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485 490 495 

Ala Asp Pro Val Arg Thr Tyr His Val Asn Leu Ser lie Phe Thr Ser 
500 505 510 

lie Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro lie Val Pro lie 
515 520 525 

His Arg Leu Asp Glu Lys Pro 'Ala Val Arg Gly lie Leu Ser Asp Leu 
530 535 540 



Thr Cys Asp Ser Asp Gly Lys Val Asp Lys Phe lie Gly Gly Glu Ser 
545 550 555 560 

Ser Leu Pro Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr 
565 570 575 

Tyr Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly 
580 585 590 

Leu His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser 
595 600 605 

Asp Ser Ala His Ser Phe Ala Met Thr Arg Ser Val Pro Gly Pro Ser 
610 615 620 

Cys Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe 
625 630 635 640 

Glu Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Asp Asp 
645 650 655 

Lys Gly Leu Ala Val Glu Ser Leu Ala Ser Ser Val Ala Gin Ser Phe 
660 665 670 

His Asn Met Pro Tyr Leu Val Ala Pro Ser Ser Cys Arg Phe Thr Ala 
675 680 685 

Ala Thr Asp Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn 
690 695 700 

Ala Ala Asp Ser Ala Thr Gly Glu Asp Glu He Trp Ser Tyr Cys Thr 
705 710 715 720 

Ala 
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<210> 23 
<211> 2695, 
<212> DNA 
<213> Plant . 

<400> 23 

ttcacgttct cttctcaatt cccataaaag aaacccttcc gttaggtttc cgtcctattt 60 

tctcttcttc tacgcttcct cttctgatat caatatctgt atggtgtttt tcttgttcga 120 

attttagatt . tgttttgcct ttaatacctg taaccttata attctctgtt taaaccaaaa 180 

acttagcttc ttctgaagtc agggtgggga tatttggatc gtgtaagagt gtgttagaag 240 

gtgattatct tttgattcag ttcctttttt gcttcttttg agggggtagc cggggcctcg 300 

gcctcggcgg gttttaatag cccccatcta ttacaaccat tgggcaaaaa catcattaaa • 360 

tctgtacaaa gcaaaccctt- aatttagttt aattttctgt attctttgat tctttaacag 420 

aagaagaaga gatgccggcc ctaggttgtt gcgtagacgc tactgtttcc cctcctctcg 480 

gctatgcctt ctctcgggat agctctcttc ccgcgccgga gttctttacc tccggcgtac 540 

ctcctacaaa ctccgccgcc ggttccattg gg.t.ctpcgga. tctgtcctct gctttgtacg 600 

gggtcgatgg gtggggagct ccttatttct ccgttaactc taacggagat atctccgtcc 660 

gaccacatgg tacggacaca ctcccccacc aggaaattga ccttctcaag gtcgtgaaaa 72 0 

aggcctccga cccgaaaaat tcaggggggc tcgggcttca gctgcctctt gttgttcgct 78 0 

tccctgatgt gctaaaaaac cggttggaat ctctgcaatc ggcttttgat ctcgctgttc 840 

attcccaggg ctatggggcc cactaccaag gtgtttatcc cgtgaaatgc aatcaagaca 900 

ggttcgtggt ggaagatatt gtcaaattcg ggtcgtcattr-ccggttcggg ttggaagctg 960 

ggtctaaacc cgagctcctg ttagccatga gctgtctctg caggggcagt gctgagggcc 1020 

ttctcgtttg caatggtttc aaggacgctg agtacatttc gcttgctttg gttgcaagaa 1080 

agctcatgtt aaacactgta attgttcttg aacaagagga ggagcttgac cttgtgattg 1140 

atataagccg taagatggct gttcggcccg taattggact tcgggctaag ctcaggacca 1200 

agcattcagg ccattttgga tccacttctg gagaaaaagg taagtttggg cttacaacga 1260 

cccaaattgt tcgtgtagtg aagaagctgg aagaatccgg aatgctggat tgccttcagt 1320 

tgctgcattt tcacattgga tctcagatcc cttcaacggc gttgcttgct gatggtgttg 1380 

gtgaggctgc tcagatttat tgtgaattaa tccgtcttgg tgcgggtatg aagttcattg 1440. 

atactggagg tgggctcgga attgattatg atggtactaa atcatgtgat tcagatgtct 1500 

ctgttggcta tggcattcaa gaatacgcct ccacagttgt ccaggcggtt caatatgttt 1560 

gcgaccgtaa gggcgtgaag cacccagtga tttgcagcga aagtggcagg gcaattgttt 1620 

ctcatcactc aattctgatt ttcgaagccg tgtctgcttc tagtcactca tgttcttctt 1680 

cacatctgtc ttctggtggc ctccaatcca tggcggagac gctcaatgaa gatgcccttg 1740 

ctgattaccg caatttatct gctgctgcag. ttcgtggaga gtacgagacg tgtgtacttt * 1800 

actctgatca gttgaaacag agatgtgtgg . atcagtttaa agaagggtcc ttgggtattg 1860 

aacatcttgc tgctgttgat agcatctgtg attttgtatc aaaggctatg ggggctgctg 1920 

atcctatccg cacttaccat gtgaatctgt caattttcac ttbaattcct gatttttggg 1980 

cctttggtca attgtttccg attgttccaa tacaccgttt agatgaaaag cctgcagtaa 2040 

ggggaatatt atcggacttg acttgtgaca gtgatgggaa ggttgataag ttcattggtg 2100 

gcgaatcaag cttgcagctg catgaattgg gaagtaatgg cgatggtggt gggtattatc 2160 

tggggatgtt tttgggtggg gcttatgagg aggcgctcgg aggactccac aacctgtttg 2220 

gtggaccaag cgtggtgcgc gtggtgcaga gcgatagcgc tcacagcttc gccatgtctc 2280 

gctccgtccc tggcccgtcc tgcgcggacg tgctccgagc gatgcagcac gagcccgagc 2340 

tcatgttcga gactctcaag caccgtgcgg aggaattctt ggaacaagaa gaagacaaag 2400 

ggctggccat tgcatctttg gccagcagct tagctcagtc cttccataac atgccttacc 2460 

ttgtggcgcc tgcatcttgc tgcttcactg cagttactgc taacaacggt ggctataact 252 0 
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actattacag tgatgagaat gcagcagatt ctgctacagg ggaggatgag atttggtcct 2580 
attgcactgc ttgaagtgtt gtcgtagcat ctccagtttt agtttgtcgt cgaagttgtc 2640 
tgtttttgaa taataccctt agttggtgat gtttttctaa aaaaaaaaaa aaaaa 2695 

<210> 24 
<211> 720 
<212> PRT . 
<213> Plant 

<400> 24 

Met Pro Ala Leu. Gly Cys Cys Val Asp Ala Thr Val Ser Pro Pro Leu 
I 5 10 * 5 

Gly Tyr Ala Phe Ser Arg Asp Ser Ser Leu Pro Ala Pro Glu Phe Phe 
20 25 30 

Thr Ser Gly Val Pro Pro Thr Asn Ser Ala Ala Gly Ser He Gly Ser 
35 40 45 

Pro Asp Leu Ser Ser Ala Leu Tyr Gly Val Asp Gly Trp. Gly Ala Pro 
50 -55 60 

Tyr Phe Ser Val Asn Ser Asn Gly Asp He Ser Val Arg Pro His Gly 
65 . 70 75 80 

Thr Asp Thr Leu Pro His Gin Glu He Asp Leu Leu Lys Val Val Lys 
85 90 95 

Lys Ala Ser Asp Pro Lys Asn Ser Gly Gly Leu Gly Leu Gin Leu Pro 
100 105 HO. . 



Leu Val Val Arg Phe Pro Asp Val Leu Lys Asn Arg Leu Glu Ser Leu 
115 120 125 

Gin Ser Ala Phe Asp Leu Ala Val His Ser Gin Gly Tyr Gly Ala His 
130 135 140 

Tyr Gin Gly Val Tyr Pro Val Lys Cys Asn Gin Asp Arg Phe Val Val 
145 150 155 160 

Glu Asp He Val Lys Phe Gly Ser Ser Phe Arg Phe Gly Leu Glu Ala 
165 170 175 

. Gly Ser Lys Pro Glu Leu Leu Leu Ala Met Ser Cys Leu Cys Arg Gly 
180 185 190 

Ser Ala Glu Gly Leu Leu Val Cys Asn Gly Phe Lys Asp Ala Glu Tyr 
195 200 205 
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lie Ser Leu Ma Leu Val Ala Arg Lys Leu Met Leu Asn Thr Val He 
210 215 220 

Val Leu Glu Gin Glu Glu Glu Leu Asp Leu Val He Asp He Ser Arg 
225 230 235 240 

Lys Met Ala Val Arg Pro Val He Gly Leu Arg Ala Lys Leu Arg Thr 
245 250 255 

Lys His Ser Gly His Phe Gly Ser Thr Ser Gly Glu Lys Gly Lys Phe 
260 265 270 

Gly Leu Thr Thr Thr Gin He Val Arg Val Val Lys Lys Leu Glu Glu 
275 280 285 

Ser Gly Met Leu Asp Cys Leu Gin Leu Leu His- Phe His He Gly Ser 
290 295 300 

Gin He Pro Ser Thr Ala Leu Leu Ala Asp Gly Val Gly Glu Ala Ala 
305 310 315 320 

Gin He Tyr Cys Glu Leu He Arg Leu Gly Ala Gly Met Lys Phe' He 
325 330 335 

Asp Thr Gly Gly Gly Leu Gly He Asp Tyr Asp Gly Thr Lys Ser Cys 
340 345 350 

Asp Ser Asp Val Ser Val Gly Tyr Gly He Gin Glu Tyr Ala Ser Thr 
355 360 365 

Val Val Gin Ala Val Gin Tyr Val Cys Asp Arg Lys Gly Val Lys His 
370 375 380 

Pro Val He Cys Ser Glu Ser Gly Arg Ala He Val Ser His His Ser 
385 390. 395 400 

He Leu He Phe Glu Ala Val Ser Ala Ser Ser His Ser Cys Ser Ser 
405 410 415 

Ser His Leu Ser Ser Gly Gly Leu Gin Ser Met Ala Glu Thr Leu Asn 
420 425 430 

Glu Asp Ala Leu Ala Asp Tyr Arg Asn Leu Ser Ala Ala Ala Val Arg 
435 440 445 

Gly Glu Tyr Glu Thr Cys Val Leu Tyr Ser Asp Gin Leu Lys Gin Arg 
450 455 460 
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Cys Val Asp Gin Phe Lys Glu Gly Ser Leu Gly He Glu His Leu Ala 
465 470 475 480 

Ala Val Asp Ser He Cys Asp Phe Val Ser Lys Ala Met Gly Ala Ala 
485 490 495 

Asp Pro He Arg Thr Tyr His Val Asn Leu Ser He Phe Thr Ser He 
500 505 510 

Pro Asp Phe Trp Ala Phe Gly Gin Leu Phe Pro He Val Pro He His 
515 520 525 

Arg Leu Asp Glu Lys Pro Ala Val Arg Gly He Leu Ser Asp Leu, Thr 
530 535 540 

Cys Asp Ser Asp Gly Lys Val Asp Lys Phe He Gly Gly Glu Ser Ser 
545 550 555 560 

Leu Gin Leu His Glu Leu Gly Ser Asn Gly Asp Gly Gly Gly Tyr Tyr 
565 570 575 

Leu Gly Met Phe Leu Gly Gly Ala Tyr Glu Glu Ala Leu Gly Gly Leu 
580 585 590 

His Asn Leu Phe Gly Gly Pro Ser Val Val Arg Val Val Gin Ser Asp 
595 600 605 

Ser Ala His Ser Phe Ala Met Ser Arg Ser Val Pro Gly Pro Ser Cys 
610 615 620 

Ala Asp Val Leu Arg Ala Met Gin His Glu Pro Glu Leu Met Phe Glu 
625 630 635 640 

Thr Leu Lys His Arg Ala Glu Glu Phe Leu Glu Gin Glu Glu Asp Lys 
645 650 655 

Gly Leu Ala He Ala Ser Leu Ala Ser Ser Leu Ala Gin Ser Phe His 
660 665 670 

Asn Met Pro Tyr Leu Val Ala Pro Ala Ser Cys Cys Phe Thr Ala Val 
675 680 685 

Thr Ala Asn Asn Gly Gly Tyr Asn Tyr Tyr Tyr Ser Asp Glu Asn Ala 
690 695 700 

Ala Asp Ser . Ala Thr Gly Glu Asp Glu He Trp Ser Tyr Cys Thr Ala 
705 710 715 720 



34 



WO 00/67558 



PCT/US00/12450 



<210> 25 
<211> 914 
<212> DNA 
<213> Plant 



<400> 25 

aagctcgann ttaancctca ntaaagggaa caaaagctgg taccgnggcc ccccctcgag 60 
gtcgacggta tcgataagct tgattaagct tagtangcac attagcagcg cttgggatga 120 
ttttaggcgc ggcctattcc ctttggctat ataatcgtgt ggttctggga attaaaaccc 180 
gatttcctcc ataaattctc cgatctaaat. ggcanagaat tttcatattt ataccttttc 240 
ttgttggagt tgtttggatg ggtgtttacc ccaaagtgtt cctgggactg catgcataca 300 
tccgtaagta, acttaagtgc aacatggaaa atttcattga gaggaatcag caaaaaaaaa 360 
aagcttaatc gaattcctgc agcccggggg atccactagt tctagagcgg ccgccaccgc 420 
ggtggagctc caattcgccc tatagtgagt cgtattacaa ttcactgggc cgtcgtttta 4 80 
caacgtcgtg actgggaaan gcctggcgtt accaacttaa tcgccttgca gcacatcccc 540 
ctttcgccag ctgggcgtaa tagcgaanag gccgcacgat cgccttccca acagttgcgc 600 
acctgatggn gaatgggacn gccctgtanc ngcgcangaa ncgcggcngg tgtggtggta 660 
ccccancgtg acgcnaactt gcaacgccta acgccgcnct tcgcttctcc ttcttctngc 720 
aagttcncgg ctttccgtaa gtccaagcgg gggnccttag gttcgattat gttnggaccc 7 80 
ccccnaaact gttanggtnt gtgatttggc accccaaaac gtttcccttg nctggtcntt 840 
ttaaangcct ntcaacngaa accaccatcg gnacttgtta aggntncctt gcntgnaaaa 900 

914 

nngtaaatta nntn 

<210> 26 
<211> 829 
<212> DNA 
<213> Plant 

<400> 26 

agcaagctcg atatcgccct cactaaaggg aacaaaaact ggtaccgggc cccccctcga 60 
ggtcgacggt, atcgataagc ttgattaanc tttttttttt tgaactacac aagggaattt 120 
cttctcctna gtaacacatg agaataatta gtgcaataaa ttacaagagg aacattgcag 180 
ttggatttaa gaatctgcgc tggggaattt agcctcaata tttgctacaa ccgtacagat 240 
ttcactgcat tcatgaacga tagtatccgt gacacatcct tttggatgcc gtcctgtcca 300 
catatgccac tactcacatc cactccattg ggtttaagtt gcagaaagag cttcacaaac 360 
attctccggg ttaattcctc ctgccaagag ccacccatgt ttgcttctaa tgcgcggcag 420 
cttaaactga acccagttga atcctttgcc actgccacct tttgcactat ccactagaac 480 
ccaatcaacc agagaagact cctcatcaga aatatagttc aaaaggctcc tcttcatttg 540 
catgaagtac gtatattaca cgtttttccc tgaccaaagc ttaatcgaat tctgcagccc 600 
gggggatcng gnattctaga gcggcgccac gcggtggagc tccaatcgcc taaatgancn 660 
ataaaatcac tggccgtcgt ttanacgncn ggacgggaaa cctgggtacc aacttaatcg 720 
cctgnagcna tccccttcnc agcggngtan acgaaaggcc gncgattgcc tccanattgc 780 
cacnggatgg aanggacncc gtncgganga acngggggnn ggggtaccn 829 
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